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Abstract. This review is devoted to the study of stationary solutions of lin- 
ear and nonlinear equations from relativistic quantum mechanics, involving the 
Dirac operator. The solutions are found as critical points of an energy func- 
tional. Contrary to the Laplacian appearing in the equations of nonrelativistic 
quantum mechanics, the Dirac operator has a negative continuous spectrum 
which is not bounded from below. This has two main consequences. First, the 
energy functional is strongly indefinite. Second, the Euler-Lagrange equations 
are linear or nonlinear eigenvalue problems with eigenvalues lying in a spectral 
gap (between the negative and positive continuous spectra). Moreover, since 
we work in the space domain R 3 , the Palais-Smale condition is not satisfied. 
For these reasons, the problems discussed in this review pose a challenge in the 
Calculus of Variations. The existence proofs involve sophisticated tools from 
nonlinear analysis and have required new variational methods which are now 
applied to other problems. 

In the first part, we consider the fixed eigenvalue problem for models of a 
free self-interacting relativistic particle. They allow to describe the localized 
state of a spin-1/2 particle (a fermion) which propagates without changing 
its shape. This includes the Soler models, and the Maxwell-Dirac or Klein- 
Gordon-Dirac equations. 

The second part is devoted to the presentation of min-max principles al- 
lowing to characterize and compute the eigenvalues of linear Dirac operators 
with an external potential, in the gap of their essential spectrum. Many con- 
sequences of these min-max characterizations are presented, among them a 
new kind of Hardy-like inequalities and a stable algorithm to compute the 
eigenvalues. 

In the third part we look for normalized solutions of nonlinear eigenvalue 
problems. The eigenvalues are Lagrange multipliers, lying in a spectral gap. 
We review the results that have been obtained on the Dirac- Fock model which 
is a nonlinear theory describing the behavior of N interacting electrons in an 
external electrostatic field. In particular we focus on the problematic definition 
of the ground state and its nonrelativistic limit. 

In the last part, we present a more involved relativistic model from Quan- 
tum Electrodynamics in which the behavior of the vacuum is taken into ac- 
count, it being coupled to the real particles. The main interesting feature of 
this model is that the energy functional is now bounded from below, providing 
us with a good definition of a ground state. 
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Introduction 

In this paper, we present various recent results concerning some linear and non- 
linear variational problems in relativistic quantum mechanics, involving the Dirac 
operator. 

Dirac derived his operator in 1928 [44], starting from the usual classical expres- 
sion of the energy of a free relativistic particle of momentum p G M 3 and mass 
m 

(1) E 2 =c 2 \p\ 2 + m 2 c A 

(c is the speed of light), and imposing the necessary relativistic invariances. By 
means of the usual identification 

p < — > —ihV 

where h is Planck's constant, he found that an adequate observable for describ- 
ing the energy of the free particle should therefore be a self-adjoint operator D c 
satisfying the equation 

(2) (D c f = -c 2 H 2 A + m 2 c 4 . 
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Taking the locality principle into account, Dirac proposed to look for a local oper- 
ator which is first order with respect to p — —ihV: 

3 

(3) D c = —ich a. ■ V + mc 2 (3 = —icH ctkdk + mc 2 (i 1 

fe=i 

where cti, oti, 03 and (3 are hermitian matrices which have to satisfy the following 
anticommutation relations: 

{a k on -I- ae au = 2<^1, 
a k p + f3a k = 0, 
f3 2 = 1. 

It can be proved [159] that the smallest dimension in which (J4j) can take place is 4 
(i.e. eti, 012, «3 and fj should be 4 x 4 hermitian matrices), meaning that D c has 
to act on L 2 (M. 3 , C 4 ). The usual representation in 2 x 2 blocks is given by 

where the Pauli matrices are defined as 

l\ fO -i\ fl 



° = \\ 0)' <72= { l o)> CT3= vo -1 

By ([l]), the time-dependent Dirac equation describing the evolution of a free 
particle is 

o 

(5) ih— * = D c *. 
w dt 

This equation has been successfully used in physics to describe relativistic particles 
having a spin 1/2. 

The main unusual feature of the Dirac equation is the spectrum of D c which is 
not bounded from below: 

(6) cr(D c ) — (—00, -mc 2 ] U [mc 2 , oo). 

Compared with non-relativistic theories in which the Schrodinger operator — A/(2m) 
appears instead of D c , property ([6]) leads to important physical, mathematical and 
numerical difficulties. Indeed, if one simply replaces —A/(2m) by D c in the ener- 
gies or operators which are commonly used in the non-relativistic case, one obtains 
energies which are not bounded from below. 

Although there is no observable electron of negative energy, the negative spec- 
trum plays an important role in physics. Dirac himself suspected that the negative 
spectrum of his operator could generate new interesting physical phenomena, and 
he proposed in 1930 the following interpretation [151 141)1 Wf\ : 

"We make the assumption that, in the world as we know it, nearly 
all the states of negative energy for the electrons are occupied, with 
just one electron in each state, and that a uniform filling of all the 
negative-energy states is completely unobservable to us." [4"7] 
Physically, one therefore has to imagine that the vacuum (called the Dirac sea) is 
filled with infinitely many virtual particles occupying the negative energy states. 
With this conjecture, a real free electron cannot be in a negative state due to the 
Pauli principle which forbids it to be in the same state as a virtual electron of the 
Dirac sea. 
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With this interpretation, Dirac was able to conjecture the existence of "holes" in 
the vacuum, interpreted as "anti-electrons" or positrons, having a positive charge 
and a positive energy. The positron was discovered in 1932 by Anderson 3J. Dirac 
also predicted the phenomenon of vacuum polarization: in the presence of an electric 
field, the virtual electrons are displaced, and the vacuum acquires a nonconstant 
density of charge. All these phenomena are now well known and well established 
in physics. They are direct consequences of the existence of the negative spectrum 
of D c , showing the crucial role played by Dirac's discovery. 

Actually, in practical computations it is quite difficult to deal properly with the 
Dirac sea. As a consequence the notion of "ground state" (state of "lowest energy" 
which is supposed to be the most "stable" for the system under consideration) 
is problematic for many of the models found in the literature. Numerically, the 
unboundcdness from below of the spectrum is also the source of important practical 
issues concerning the convergence of the considered algorithms, or the existence of 
spurious (unphysical) solutions. 

Dirac's interpretation of the negative energies will be an implicit assumption in 
all this review in the sense that we shall (almost) always look for positive energy 
solutions for the electrons. In the last section, we present a model from Quantum 
Electrodynamics (QED) in which the nonlinear behavior of the Dirac sea will be 
fully taken into account. 

Mathematically, most of the energy functionals that we shall consider are strongly 
indefinite: they are unbounded from below and all their critical points have an in- 
finite Morse index. Note that the mathematical methods allowing to deal with 
strongly indefinite functionals have their origin in the works of P. Rabinowitz con- 
cerning the study of nonlinear waves [134] and also the study of periodic solutions 
of Hamiltonian systems [135] , Many results have followed these pioneering works, 
and powerful theories have been devised, in particular in the field of periodic orbits 
of Hamiltonian systems: the linking theorem of Benci-Rabinowitz p~7], Clarke- 
Ekeland's dual action functional [35], Conley-Zehnder's relative index [41] , Floer's 
homology [74]... 

Another difficulty with the models presented in this review is the lack of com- 
pactness: the Palais-Smale condition is not satisfied due to the unboundedness of 
the domain K 3 . Variational problems with lack of compactness also have been 
extensively studied. Let us mention the work of Sacks-Uhlenbeck |145j on har- 
monic maps, Lieb's Lemma |114) . Brezis-Nirenberg's study of elliptic PDEs with 
critical exponents [25], the concentration-compactness method of P.-L. Lions |118j . 
Bahri-Coron's critical points at infinity [10] and more recently Fang-Ghoussoub's 
Palais-Smale sequences with Morse information [68] 177]. 

The combination of the two above types of difficulties poses a challenge in the 
Calculus of Variations. To prove the results presented in this review, it has been 
necessary to adapt some of the sophisticated tools mentioned above and to introduce 
new ideas. The novel variational methods that we have designed can be applied 
in general situations and in particular in the study of crystalline matter (nonlinear 
Schrodinger equations with periodic potentials). 

The review contains four different parts which are almost independent. The 
common feature of all the problems addressed in this review is the variational study 
of linear and nonlinear eigenvalue problems with eigenvalues in spectral gaps. In 
the nonlinear case, there are two different classes of problems. Either we fix the 
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eigenvalue and let the L 2 -norm of the solutions free. Or we look for normalized 
solutions, the eigenvalue is then a Lagrange multiplier which has to stay in the 
spectral gap. 

In the first section, we describe the results that have been obtained for some 
models describing one self- interacting free relativistic spin- 1/2 particle. The sim- 
plest case is when the interaction is "local" , i.e. represented by a nonlinear function 
F of the spinor ip(t, x) of the particle. The general form for the equations that we 
consider in this part is: 

D c i/j -wi> = VF(tp). 

These models are phenomenological. A Lorentz-invariant interaction term F(i{i) is 
chosen in order to find a model of the free localized electron (or on another spin 
1/2 particle), which fits with experimental data (see [138j ) . 

At the end of the first section, we present two nonlocal models: the Maxwell- 
Dirac and the Klein-Gordon-Dirac equations in which the electron interacts with its 
own electromagnetic field. The Maxwell-Dirac equations take the following form: 

(D c + v — a ■ A)ijj — ujtfj, 
-4ttAv = |-0| 2 , 

-4irAA k = (V, a k iP) , fc=l,2,3. 

From a mathematical viewpoint, the equations considered in the first section are 
nonlinear eigenvalue problems, in which the eigenvalue is fixed in a spectral gap, 
but the L 2 norm of the solution is not known. They are the Euler-Lagrange equa- 
tions of a strongly indefinite functional. Moreover, this functional does not satisfy 
the Palais-Smale condition and the classical Benci-Rabinowitz linking theorem [17] 
cannot be applied. The solutions are obtained by a "noncompact" linking argument 
inspired by the works of Hofer-Wysocki [93] and Sere |150j on homoclinic orbits of 
Hamiltonian systems. An additional difficulty is that the nonlinearity can vanish 
even for very large values of tp, and this makes the a priori bounds on Palais-Smale 
sequences very delicate. 

The second section is devoted to the study of min-max principles allowing to 
characterize the eigenvalues of Dirac operators with an external potential V in the 
gap of their essential spectrum. Such operators are commonly used to describe the 
dynamics of an electron which is subject to the action of an external electrostatic 
field with associated potential V (for instance an electron in the field created by 
a nucleus). It can also be used to describe many non-interacting electrons. For 
potentials V satisfying appropriate assumptions, the spectrum of the perturbed 
Dirac operator D c + V takes the form 

<j(D c + V) = (—oo, — mc 2 } U {e^jigN U [mc 2 , oo) 

where the e^'s are eigenvalues of finite multiplicity in (-mc 2 , mc 2 ), which can only 
accumulate at the thresholds —mc 2 or mc 2 (see [HI [19]). The min-max formulas 
presented in this section furnish a useful variational characterization of the e^'s and 
of the associated eigenfunctions: 

(D c + V)ifi = Eiifi. 

The min-max formulas are general and can be used in other settings where eigen- 
values in a gap of the essential spectrum have to be characterized or computed. 
Many consequences of the min-max principles are derived in this section, including 
an algorithm for the computation of the eigenvalues. 
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In Section 3, we present results concerning the Dirac-Fock model [156] . allowing 
to describe N interacting electrons in an external electrostatic field. This is a 
nonlinear model which is used in Quantum Chemistry to compute the state of such 
electrons in heavy atoms. The energy functional is strongly indefinite and therefore 
it is really not obvious how to find an adequate definition of the ground state, 
and to prove the existence of critical points in general. Explaining how this can 
be achieved is the main goal of the section. The model consists of a system of N 
coupled nonlinear equations, posed on L 2 (R 3 , C 4 )^: 

D c <S! ifi = enpi, < £j < mc 2 , 

where 

£U =D C + V + T*, 
r$ being an operator depending nonlinearly on $ := (tfi, ^n) and which models 
the interactions between the electrons. The functions tpi'8, which are assumed to 
satisfy the constraints J R3 (ipi, ipj) = 5ij, represent the states of the N electrons. 
Being a system of nonlinear eigenvalue problems with eigenvalues in a spectral 
gap, the Dirac-Fock equations carry some similarity with the equations studied in 
Section 1. But there is a big difference: the L 2 norm of the solutions is now fixed 
a priori, and the eigenvalues e% are unknown Lagrange multipliers associated with 
these constraints. This makes the problem harder, the main difficulty being to keep 
the multipliers Si in the interval (0,mc 2 ). The positivity of £i is obtained thanks 
to a new penalization method. See [26] for a generalization of this method, with 
applications to nonlinear periodic Schrodinger models for crystals. The inequality 
£j < mc 2 follows from Morse-type estimates, as in the existence proof of Lions for 
the nonrelativistic Hartree-Fock model [119j . To obtain these Morse- type estimates, 
the easiest way is to use a general theorem of Fang-Ghoussoub [B5j. Note that, since 
the functional is strongly indefinite, one has to work in fact with a relative Morse 
index. 

Finally, in the last section we present a more involved physical model in which 
the behavior of the electrons is coupled to that of the Dirac sea, in the presence 
of an external electrostatic field V. In this model, Dirac's interpretation of the 
negative energies is really taken into account: the vacuum is considered as being an 
unknown physical object which can react to an external stimulation. The important 
feature of the model will be that the energy functional is bounded from below, as 
first proposed by Chaix and Iracane 36J, showing the importance of the vacuum 
polarization effects. The main drawback will be that one necessarily has to deal 
with infinitely many interacting particles (the real ones and the virtual ones of the 
Dirac sea), which creates lots of mathematical difficulties. In particular, the main 
unknown of the model is, this time, an orthogonal projector P of infinite rank. The 
optimal projector P representing the ground state of the system is solution of a 
nonlinear equation of the form 

(7) P = X{-o.A D c + V + T' P ) 

where T' p is an operator depending on the projector P and describing the inter- 
actions between all particles (the real and the virtual ones). We have used the 
standard notation xi(A) for the spectral projector of A associated with the inter- 
val /. Solutions of (0 are obtained by a minimization principle, on a set of compact 
operators. One has to be very careful in the choice of this set, and in the definition 
of the energy. A serious difficulty is the presence of ultraviolet divergencies. 



VARIATIONAL METHODS IN RELATIVISTIC QUANTUM MECHANICS 



7 



Acknowledgment. First of all, M.J.E. and E.S. would like to thank T. Cazenave, 
I. Ekeland, P.-L. Lions and L. Vazquez for having introduced them to this field. We 
would like also to thank J. Dolbeault, V. Georgiev, C. Hainzl, M. Loss and J. -P. 
Solovej with whom we have collaborated on differents parts of the works described 
in this review. Some years ago P. Chaix was very patient in explaining to us some 
models of QED on which we have worked later; the last section of this paper has its 
origin in those conversations. Several discussions with B. Buffoni were very inspiring 
to us when looking for new variational principles in the case of strongly indefinity 
functionals. We want also to acknowledge our participation in the european research 
projects "Analysis and Quantum" (HPRN-CT-2002-00277) and "HYKE" (HPRN- 
CT-2002-00282). The interaction with our partners in those two projects has been 
a source of inspiration and motivation for us along the years and we thank them for 
that. We also acknowledge support from the ANR "ACCQUAREL" of the french 
ministry of research. E.S. was partially supported by the "Institut Universitaire 
de France". Finally, we wish to thank P. Rabinowitz, who has been very patient 
encouraging us to write this review article. 

Notations and basic properties of the Dirac operator. 

Before going further, let us fix some notations. All throughout this review, the 
conjugate of z 6 C will be denoted by z* . For X — [z\, ...,Z4) T a column vector 
in C 4 , we denote by X* the row co-vector (z*, ...,z|). Similarly, if A = (fly) is a 
4x4 complex matrix, we denote by A* its adjoint, (A*)ij — a* :i . 

We denote by (X, X') the Hermitian product of two vectors X, X' in C 4 , and 
by |X|, the canonical hermitian norm of X in C 4 , i. e. \X\ 2 = J2l=i X*X, . The 
usual Hermitian product in L 2 (M 3 ,C 4 ) is denoted as 



(8) = f U(x)^\x)) 

./1B3 V / 



d 3 x. 

For the sake of simplicity, we shall use a system of units in which 

m = h = 1. 

Actually, by scaling one can also fix the value of another physical constant, like 
for instance the speed of light c or the charge of an electron e. We shall use both 
possibilities in this review (they are of course equivalent). 

Let us now list some basic and important properties of the free Dirac operator. 
We refer to the book of Thaller [159] for details. 

Proposition 1 (Basic properties of the free Dirac operator). The free Dirac opera- 
tor D c is a self-adjoint operator on L 2 (K 3 ,C 4 ), with domain 77 1 (]R 3 ,C 4 ) and form- 
domain i? 1 ^ 2 (R 3 , C 4 ). Its spectrum is purely continuous, cr(D c ) = (— oo, — c 2 ] U 
[c 2 ,+oo). Moreover, there are two orthogonal projectors (both having infinite rank) 
from L 2 (]R 3 , C 4 ) into itself, P? c and P° c = 1 2 - P? c , such that 



(9) 



D c Pl, c = P+,c D c = V C-g'A P £ |C - P^ c Vc 4 - c 2 A 
D C P°\ = P°' C D C = -Vc 4 -c 2 A P° c = -P° Vc 4 - c 2 A. 



The projectors P° c and P® c are multiplication operators in the Fourier domain, 
given by 



(10) 



p0 _ ±Dc(p) + |p| 2 + c 4 



2^/c 2 H 2 + c 4 



8 



MARIA J. ESTEBAN, MATHIEU LEWIN, AND ERIC SERE 



Note that Proposition Q] enables us to split the space 

Sj := L 2 (R 3 ,C 4 ) 

as the direct sum of two infinite dimensional Hilbert spaces Sj j_ c = P£ c Sj. The 
restriction of D c to Sj± c is a self-adjoint operator in this subspace, with domain 
iOj. c H H 1 (M 3 , C 4 ). Furthermore, it will be convenient to use the following norm in 
$), equivalent to the usual norm of i/ 1 / 2 (]R 3 , C 4 ), 

(11) Hh : = UaDcPI.c-DcP ^) 1 ' 2 = U,\d c \^) 1/2 

The subspaces Sj° +>c n H 1 / 2 (R 3 , C 4 ) and i}° jC n tf 1 / 2 ^ 3 , C 4 ) arc orthogonal for this 
norm as well as for the L 2 norm. 

When c = 1, one recovers the usual H 1 / 2 norm. In this case, we shall use the 
convenient notation P± 1 = P± and Sj± 1 = 

1. Nonlinear Dirac equations for a free particle 

In this section, we present some nonlinear Dirac equations for a free particle. We 
therefore do not consider any external potential (but possibly a self-generated one) . 
Stationary solutions of such equations represent the state of a localized particle 
which can propagate without changing its shape. The first to propose and study 
models for the description of this phenomenon were Ivanenko [97], Weyl [169] and 
Heisenberg [90]. We refer to Ranada [138] for a very interesting review on the 
historical background of this kind of models. 

In this section, we shall always assume that c = 1. A general form for the 
equations that we want to present is 

(12) idtV - £>i* + VF(*) = , 

where *(£, ■) S i 2 (IR 3 , C 4 ). Throughout this chapter we assume that F e C 2 (C 4 , K) 
satisfies 

(13) F(e i0 ty) = for all 6 G K . 

The charge of the particle — e does not appear since it is incorporated into the 
nonlinear functional F. 

The relativistic invariance requirement imposes severe restrictions on the possible 
nonlinearities F. In two very interesting papers [69, 70 , Finkelstein et al proposed 
various models for extended particles corresponding to various fourth order self- 
couplings F. In those papers, they gave some numerical description of the structure 
of the set of solutions, for different values of the parameters. Among the considered 
functions F, in [69l [70] we find the family of general self-couplings 

(14) F 6 (tf) := A{(**) 2 + 6(*7 5 *) 2 } 

where 7 5 = — ia.\a.20LZi b is a real constant, and where we have used the notation 

:= 08*,*). 

In the sequel, without any loss of generality, we will assume that A = 1/2. 
Stationary solutions of (fT2"]) are functions of the type 

(15) *(t,a) = e- iut i>(x), 
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such that tp is a non-zero localized solution of the following stationary nonlinear 
Dirac equation 

(16) Dxip - Dip - VF(ip) = in M 3 . 

It is interesting to note that the latter equation has a variational structure: it is 
indeed the Euler-Lagrange equation associated with the functional 

(17) /"(V) = fJ^iDx^x^x^-^xt-F^x^dx. 

Hence, the solutions of (fll)]) are formally the critical points of the "energy" func- 
tional I u . In this context, we say that i/j is a localized solution if /"(VO is well- 
defined, that is, if i/j S H^ 2 (R 3 ,C 4 ) and F(ip) e L 1 (R 3 ,R) . Due to the structure 
of the Dirac operator Di, the functional J" is not bounded below and solutions of 
(|16p cannot be obtained by a minimization method. 

1.1. Soler models: existence by O.D.E. techniques. The case b — in the 
definition (fl4|) of Fj, was proposed by Soler in [154 to describe elementary fcrmions. 
In this case, (JTTJ) reduces to 

(18) Dxip - (jtp - (V^)^V = in R 3 
which is usually called the Soler model. Its more general version 

(19) Diij) - uil> - g$ip)P<tl) = in M 3 

is often called the generalized Soler equation, and it is the Euler-Lagrange equation 
associated with I u for F(ip) = §G(|v) , G' = 5, G(0) = 0. 

The main advantage of (|19p is the existence of a special ansatz compatible with 
the equation: 

. , , / cos 
\ v ' ^sm e K J J 

In this ansatz, equation ([19]) reduces to the O.D.E. system 

+ = yUv* - u 2 ) - {1- u)) 

v' = u[g(v 2 - u 2 ) - (1 + lo)) 

The O.D.E. system (f2T|) has been extensively studied. In |164j Vazquez proved 
some qualitative properties of the solutions in the case < lo < 1, and showed the 
non-existence of localized solutions when \lo\ > 1. Cazenave and Vazquez obtained 
the first rigorous existence result for this problem in [33j . More precisely, in |33j 
they proved the existence of a solution (u, v) to (f2T1) for nonlinearities g of class 
C 1 satisfying: 

(22) g: [0, +co) ^ [0, +00) , g(0) = , g > , lim g(s) = +00 , 

whenever < u; < 1. Moreover, this solution (u,v) is such that u and u are 
positive on R + , u(0) = 0, v(0) > 0. Additionally, u and u (as well as their first 
derivatives) decay exponentially at infinity. 

Later on, Merle [124] improved the above result by extending it to a more gen- 
eral class of nonlinearities g. Then, Balabane et ai proved the following general 
multiplicity result: 



(20) rl>(z) 



(21) 
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Theorem 2. ([llj) Assume that g : K — ► M is a function of class C 1 such that 
g(0) = 0, g is increasing in (0,+oo) ; g(s) > 1+uj for s large, g'(g~ l {\ — uj)) > 
and g(s) < for s < 0. Then, for any uj € (0, 1) , there exists an increasing 
sequence of positive numbers {x n } n >i such that for every n > 1, there is a solution 
(u n ,v n ) of H21\) satisfying 

• u n (0) = 0,v n (0) = x n , 

• u n and v n have n — 1 zeros in R + , 

• u n and v n decay exponentially at infinity, as well as their first derivatives. 
Moreover, if for all s, g(s) = s, then the sequence {x n } is bounded. 

In the case of singular nonlinearities, compactly supported solutions may exist. 
More concretely, the following result was proved in [12] ; 

Theorem 3. ( [12J ) Assume that g : (0, +oo) — > (— oo,0) is a function of class 
C 1 which is nondecreasing and integrable near the origin. Suppose also that there 
exists a number a such that g{a 2 ) = 1 — uj , while g'(a 2 ) > 0. Then, for every 
< uj < 1 there exists a positive solution (u,v) of i21\) . Moreover, the support of 
(u, v) is bounded if and only if 

ds f s 
/ —. < +oo , where G(s) := g(x) dx . 

Jo G(s) J Q 



1.2. Soler models: existence by variational techniques. All the above results 
were obtained by a dynamical systems approach. But it is also possible to exploit 
the variational structure of (HHJ) (and also of the O.D.E. system ([2~T]) ) to prove 
existence of solutions. 

In the case of (|2Tj) . the use of variational methods does not radically improve 
the results that were obtained by O.D.E. methods (see above). The assumptions 
needed to use variational techniques are slightly different. In [62], Esteban and Sere 
obtained the following result: 

Theorem 4. ([62]) Let F : C 4 -> K satisfy 

FW = \g$N>), GeC 2 (R,R), G(0) = 0, 

with G G C 2 (M,K). Denoting by g the first derivative of G, we make the following 
assumptions: 

(23) Viel, xg(x) > 9G(x) for some > 1, 

(24) G(0) = 5 (0) = 0, 

(25) (Vx e M, G(x) > ) and G(A ) > for some A > 0, 

(26) < uj < 1 . 

Then there exist infinitely many solutions of Equation \lb}) in 
P| 2<(?<+00 M^ 1:9 (R 3 , C 4 ). Each of them solves a min-max variational problem on 
the functional I u . They are of the form h20\) and thus correspond to classical so- 
lutions of on R + . Finally, they all decrease exponentially at infinity, together 
with their first derivatives. 
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The interest of using variational techniques appears much more clearly when one 
studies equations for which no simplifying ansatz is known, for instance in the case 
where F = Ff, with b ^ 0. In that case, Equation (|16p cannot be reduced to a 
system of O.D.Es similar to pTjl . A general result proved by Esteban and Sere in 
this context is the following: 

Theorem 5. ([62 ) Let be F(t/>) = \(\ipip\ ai + b\^ 5 ip\^) , with 1 < a\, a 2 < §; 
A, b > 0. Then, for every lj € (0,1), there exists a non-zero solution of 116]) in 

fW+^n^c 4 ). ' 

In fact, Theorem [5] is a consequence of the more general following result in [52"] 

Theorem 6. ([62 ) Assume that F : C 2 -> R satisfies : 

(27) W>eC 4 , < F(ijj) < ai(|0| Ql + M" 2 ) , 

with a\ > and 2 < a\ < a 2 < 3. Assume moreover that 

/ FeC 2 (C 4 ,l), F'(0) = F"(0) = , 
1 J 1 < a 2 |0| tt2 - 2 , a 2 >0, /or^l^e; 

(29) VVeC 4 , VF(ij;)-^ > a 3 F(V), a 3 > 2 ; 

(30) 3 a 4 > 3, V <5 > 0, 3 C s > 0, V V G C 4 , \VF(ip)\ < (<5 + CfcFtyO 7 *) |# 

(31) V^eC 4 , f(^) > a 5 |#r-a 6 , v > 1, a 5 ,a 6 >0. 

Then, for every u G (0, 1), i/iere errasfc a non-zero solution of U6\) in 

fW+oo^O^C 4 ). 

1.3. Existence of solutions by perturbation theory. Another way of finding 
solutions to nonlinear Dirac equations is perturbation theory. In this approach, 
one uses previously known information about the nonlinear Schrodinger equation 
[168] . which is approached in the non-relativistic limit. Ounaies proved in [130j that 
solutions of some nonlinear Dirac equations, when properly rescaled, are close to 
solutions of the nonlinear Schrodinger equation, with the same nonlinearity, when 
the phase lu approaches 1. More precisely, assume for instance that 

Fty) := ~(G(^0+#W¥V)) , 

where G,H are two functions of class C 2 such that G(0) = H(Q) — 0, and such 
that g := G' and h := H' are homogeneous of degree 8 £ (0, 1]. Then, if we write 
any 4-spinor ip as ip = the main theorem in |130] states the following 

Theorem 7. ( |130| ) Under the above assumptions, let 1 — oj = a 28 = X 2 = = 
e. If we rescale the functions ip, x as follows 

tp{x) := aip(Xx), x(x) ■= &x(Ax) , 

then, ip — (^) is a solution to i f 16]) if and only if (p, x are solutions to the system 

f (-ia- V)x + V-9{\v\ 2 )v + Ki(e,(p,x) = 
\ {-ia-V)(p-2x + K 2 {E,(p,x) = 0. 
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Here K\ and K2 are small functions for small e, Cp and \ taking values in a bounded 
set 0/C 2 . Moreover, for e small enough, there exist solutions to the above eguation. 
They are close to a solution of the nonlinear Schrddinger eguation 

(32) _Ia^ + ^_ 5 (|^|2)^ = 0i X = -!((,. V)$>. 

Remark 8. Note that the function h = H' does not appear in the above limiting 
eguation. 

The proof of this theorem makes use of the implicit function theorem in an 
appropriate manner. Important ingredients are the uniqueness (up to translation) 
of the solution to the elliptic equation ([52"]) for (p and its nondegeneracy [1061 HOI 

ung. 

1.4. Nonlinear Dirac equations in the Schwarzschild metric. All the above 
models correspond to the Dirac equation written in the Minkowski metric, this 
is, in flat space. But the space-time geometry plays an important role when one 
wants to take relativistic effects into account. For instance, when considering the 
Schwarzschild metric outside a massive star, the nonlinear Dirac equation appears 
to be different. 

In [9] A. Bachelot-Motet has studied numerically this problem in the case of the 
symmetric solutions as above. One has to study a system of O.D.Es similar to (j2Tj) 
but with r-dependent coefficients instead of constant ones. More precisely, in the 
ansatz (120)) and for the case F(s) — \\s\ 2 , system (|2"Tj) becomes 

^ fW + 2(/ + /V2) = V [\( V 2 - u 2 ) - (/V2 - W )] 

{ j fv' + Hf~f 1/2 ) = u[\{v>-u*)-(fV* + u)], 

where f(r) = 1 — p. 

Notice that this problem is not to be considered in the whole space: since the 
physical situation corresponds to the outside of a massive star, the natural domain 
is the complement of a ball, or in radial coordinates, the interval r > ro , for some 
ro > . In this case, the usual "MIT-bag" boundary condition reads 

u(r ) = -v(r ) . 

The very interesting numerical results obtained by Bachelot-Motet suggested 
conditions for some existence and multiplicity results for (|33|) that were later rig- 
orously proved by Paturel in [131] . Note that in (13 lj the solutions are found as 
critical points of a reduced energy functional by a mountain-pass argument, while 
as we see below, we use a linking method to produce our solutions. 

1.5. Solutions of the Maxwell-Dirac equations. The nonlinear terms appear- 
ing in all the above models are local, that is, are functions of the spinor field 
But in some cases, one has to introduce nonlocal terms, like for instance when con- 
sidering the interaction of the Dirac field with a self-generated field. In this case, 
the equations become integro-differential. 

Our first example is the Maxwell-Dirac system of classical field equations, de- 
scribing the interaction of a particle with its self-generated electromagnetic field. 
In order to write the equations in relativistically covariant form, we introduce the 
usual four-dimensional notations: let be 7 := (3 and 7 fc := /3a&. For any wave- 
function ^(xq,x) : I x I 3 h C 4 (note that xq plays the role of the time t), wc 
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denote ^> := (3^ . In the Lorentz gauge the Maxwell-Dirac equations can be written 
as follows 

(ij^d^-^A^ - 1)* = in RxM 3 

8^ = 0, 4^8^ A v = (t,7 w *) in IxR 3 . 

Notice that we have used Einstein's convention for the summation over /z. We also 
introduce the electromagnetic current J v := 7"^). 

Finite energy stationary solutions of classical nonlinear wave equations have been 
sometimes used to describe extended particles. Of course the electromagnetic field 
should in principle be quantized like in Quantum Electrodynamics. In the Maxwell- 
Dirac model, the field is not quantized but it is believed that interesting qualitative 
results can be obtained by using classical fields (see, e.g. [79] Chapter 7]). 

Another example of a self-interaction is the Klein-Gordon-Dirac system which 
arises in the so-called Yukawa model (see, e.g. [20]). It can be written as 

(«7 /i fl| ti - x ~ 1)* = in R x R 3 

d^d^x + M 2 X = h *) in Ixl 3 . 

Other related models, that we will not discuss, include the Einstein-Dirac-Maxwell 
equations, which have been investigated by F. Finster, J. Smoller and S.-T. Yau 
[71] . [72] . The above systems (134]) and ([35]) have been extensively studied and 
many results are available concerning the Cauchy problem (we refer to [51] and 
[75] . chapter 7, for detailed references). 

A stationary solution of the Maxwell-Dirac system ([34]) is a particular solution 
A) : M x K 4 h> C 4 x K 4 of the form 

f *(x ,x) = e - iuJXo ip(x) with V:K 3 ^C 4 , 

(36) 1 ^(x ,x) = ./"*^ = fwAny)- 

The existence of such stationary solutions of ([34]) has been an open problem for a 
long time (see, e.g. (7^1 p. 235]). Indeed, the interaction between the spinor and 
its own electromagnetic field makes equations ([M]) nonlinear. 

Concerning stationary solutions of (|34p . let us mention the pioneering works of 
Finkelstein et al [BUI and Wakano [166] . The latter considered this system in the 
approximation Aq ^ 0, A\ = A2 = A3 = 0, the so-called Dirac-Poisson system. 
This problem can be reduced to a system of three coupled differential equations 
by using the spherical spinors ([20]) . Wakano obtained numerical evidence for the 
existence of stationary solutions of the Dirac-Poisson equation. Further work in this 
direction (sec [138 ) yielded the same kind of numerical results for some modified 
Maxwell-Dirac equations which include some nonlinear self-coupling. 

In [120] Lisi found numerical solutions of the Dirac-Poisson and of the Maxwell- 
Dirac systems. The computation of the magnetic part of the field A for these 
solutions showed that Wakano's approximation was reasonable, since the field com- 
ponents (Ai,A 2 , A 3 ) stay small compared with A . See also [1361 11371 12"4"] where 
various kinds of stationary solutions are considered, like the so-called static solu- 
tions which have no current "flow" . 

In the case — 1 < lu < 0, Esteban, Georgiev and Sere [61] used variational 
techniques to prove the existence of stationary solutions of ([34]) . Any solution of 
([34]) taking the form (|36| corresponds (formally) to a critical point ip of the following 
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functional: 

W) = I \{ia k d k M)-\^^)-^\ 2 --JI J ^ )J j y) dxdy. 
Jr3 2 2 2 4 JJ R 3 XR 3 \x-y\ 

This remark was used in 61j to find a stationary solution of (|34p in the appropriate 
space of functions. 

Theorem 9. ([61)) For any lo strictly between —1 and 0, there exists a non-zero 
critical point ip u of I u . This function tp u is smooth in x and exponentially decreasing 
at infinity. Finally, the fields $(io,i) = e~ %u}x ° ifuj, A^(xq,x) — J% * are 
solutions of the Maxwell-Dirac system {3$ . 

Later, using cylindrical coordinates, S. Abenda [1] extended the above result to 
the case — 1 < u> < 1. Indeed, in the class of cylindrically symmetric functions, 
the energy functional has better properties which allow to use the same variational 
procedure as in the work of Esteban-Sere, but in the more general case w € (—1, 1). 

Many questions are still open about the existence of stationary solutions for (f3~4"| . 
It is easy to see that they have all a negative "mass" . Wakano already observed this 
phenomenon for the soliton-like solutions of the Dirac-Poisson system. However, it 
was shown in 166J that a positive mass can be reached by taking into account the 
vacuum polarization effect. 

For the case of the Klein-Gordon-Dirac equations the situation is slightly simpler 
because they are compatible with the ansatz (I20|) introduced above. So, in this case 
the authors of 61j did not only obtain existence of solutions, but also multiplicity: 

Theorem 10. (|61j) For any lo strictly between —1 and 0, there exists infinitely 
many solutions to the Klein-Gordon-Dirac system \35\) . These solutions are all 
smooth and exponentially decreasing at infinity in x. 

We finish this section by explaining the general ideas of the proof of Theorem [9] 
The proofs of Theorems HI [5] and [6] basically follow the same lines and we will skip 
them. 

Sketch of the proof of Theorem [9j As already mentioned in the introduction, 
the presence of the negative spectrum for the Dirac operator forbids the use of a 
minimization argument to construct critical points. Instead, the solution will be 
obtained by means of a min-max variational method based on complicated topolog- 
ical arguments. This kind of method used to treat problems with infinite negative 
and positive spectrum have been already used under the name of linking. The link- 
ing method was introduced by V. Bcnci and P. Rabinowitz in a compact context 
[17) . The reasons making the use of variational arguments nonstandard in our case 
are : (1) the equations are translation invariant, which creates a lack of compact- 
ness; (2) the interaction term J^A^ is not positive definite. Note that as we have 
alredy pointed out, in some cases one can perform a reduction procedure and ob- 
tain a reduced functional for which critical points can be found by a mountain-pass 
argument |131j . 
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First step: Estimates. Defining = J M * then one deduces J^A^ = 
J°A° - ^1=1 J k A k and 

L(^):= ff dxdy= f j^A^dx. 

JJr 3 xm 3 \ x —y\ Jr 3 

Let us also introduce the functional 
(2.2) Q W =[[ 



\x-y\ 

It is easy to prove that Q is non-negative, continuous and convex on f) = i? 1 / 2 (R 3 , C 4 ), 
and vanishes only when (ip,ip)(x) = a.e. in M. 3 . 

Let us state a lemma giving some properties of the quadratic forms in I u : 

Lemma 11. For any tp S $}, the following inequalities hold: 



(i) J^A^x) > , a.e. in K 3 , 

(ii) J R3 J»A^ > Qf», 

1/2 

(ni) A a > lELil^l ' 
(iv) h^A^l < CVA® y/WT^ a.e. in K 3 . 

Remark 12. Note that when the junction ip is cylindrically symmetric, the func- 
tional L defined above is not only non-negative, but actually controls from below 
IMIfl ( see Lemma 1 in This is the reason why Abenda has been able to treat 

the case lo £ (—1,1), extending Theorem^ 

Another important information is given by the 

Lemma 13. Let be fi> 0. There is a non-zero function e+ : (0; oo) — > S)^ = A^-f) 
such that, if A + ijj = e + (/i), then 



2Jm 3 ^ ~ 2 

Second step: Modified functional and variational argument. In order to 
obtain some coercivity, a modified functional I u<e was considered in [61] . It reads 

where e > 0. The critical points of / Wie (-0) satisfy 

i^dktp — ip — ui-f°i/j — j^A^ij} — ej°\ip\^j = 
-4ttAA q = J° = |V>| 2 , -ATrAA k = - J k . 

Let 9 be a smooth function satisfying 8(s) — for s < — 1 and 9(s) — 1 for 
s > 0. The gradient being defined by — V/ Wl£ = — |I?i| _1 /^ £ , let us consider the 
flow for positive times t, rf w e , of a modified gradient : 



(37) 



(38) 



V° = H 

dtvls = ~ Wu,e)VI», e ) o ri <E . 
It can be seen that for e > the functional L u ^ e enjoys the properties needed 
for the Benci-Rabinowitz linking argument [17] . except that its gradient is not of 
the form L + K with L linear and K compact. Due to this lack of compactness, 
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one cannot use Leray-Schauder's degree. One can work instead with a generalized 
version of the Leray-Schauder Z2-degree, due to Smale |153j to show the existence 
of a positive critical level of I UjS . This idea was introduced by Hofer-Wysocki 
[93j in the study of homoclinic orbits of nonconvex Hamiltonian systems, where a 
similar lack of compactness occurs. Hofer and Wysocki worked with the unregu- 
larized L 2 -gradient. This gradient does not have a well-defined flow, but for the 
linking argument it is sufficient to consider certain smooth gradient lines, which 
are pseudo-holomorphic curves satisfying boundary conditions. Later Sere |150| . 
studying homoclinic orbits on singular energy hypersurfaces, worked with the H 1 ! 2 - 
regularized gradient, which has a well-defined flow leading to an easier and more 
flexible linking argument. This approach is adapted to nonlinear Dirac in [62j and 
to Dirac-Maxwell and Dirac-Klein-Gordon in [BT]. Consider the sets 

AT = {V = i>- + Ae+( M ) , V- e sr. H-h < IMm)IU, a e [o, 1]} 

and 

Z + = {iP € S)f / wu = p} , P >Q. 
Then one can prove the 

Proposition 14. For any —\<uj< —p, e > and £_|_, A/"- constructed as above, 
there exists a positive constant c u , such that the set rfa £ (JV— ) H £+ is non-empty, 
for all t > 0. Moreover, the number 

Cu,..e = mf I u , e O Tfl ie (Af-) 

is strictly positive, it is a critical level for I u ^ e and c^.e — ► c u > as e — > 0. 

Additionally, for any u>,e fixed, there is a sequence {<y9™ £ }n>o such that as n — > 
+00, 

[ j I (i + lbS je ||)v/ Wie « e ) - o. 

Remark 15. In |61j and [62] - an easy regularization step is missing. Indeed, 
Smale's degree theory requires C 2 -regularity for the flow, which corresponds to C 3 - 
regularity for the functional. In the case of the local nonlinear Dirac equation, 
such a regularity can be easily achieved by a small perturbation of the function 
F(ip) + s\ip\ a2 tp. Since all the estimates will be independent of the regularization 
parameter, the solutions of the non-regularized problem will be obtained by a limiting 
argument. 

Note that the linking argument of [150] , [62] and [61] has inspired later work (see 
[162[ [157 ), where an abstract linking theorem in a noncompact setting is given, 
valid first for C 2 and then for C 1 -functionals. 

Third step: Properties of the critical sequences. The concentration-compactness 
theory of P.-L. Lions (1 18] allows us to analyze the behavior of critical sequences of 
I u , e as follows: 

Proposition 16. Let u € (—1,0) and e > be fixed. Let (ip n ) C ,f) be a sequence 
in S) such that 

(40) < inf \\ipn\U < supl^nlli, < +oo 
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and I' u s {ij)n) ~ * in Sj' as n goes to +00. Then we can find a finite integer p > 1 , 
p non-zero solutions ip 1 , . . . ,ip p of \3 r /\ ) in Sj and p sequences (x l n ) cl 3 ,i = l,...,p 
such that for i ^ j, \x l n — x 3 n \ — * +00 , and, up to extraction of a subsequence, 



-> 

ft n — >+oc 



Obtaining estimates in H 1 ^ 2 (M 3 ) for the sequence {<p" E } n >o of Proposition [Til 
is quite easy because of the coercivity introduced by the perturbation term in 
e. Moreover, c W)E being strictly positive, the sequence {<^™ e }n>o is also bounded 
from below away from 0. So, Proposition [16] applies to prove the existence of a 
solution to (|37|) for every e > 0. Next, we want to pass to the limit when e goes 
to 0. Note that we are doing so along a sequence of functions which are exact 
solutions of the approximate problem ([57)) . This part of the proof is done by first 
proving the equivalent of the Pohozaev identity for equation (|37p . e > 0, and then 
by introducing some special topologies in the spaces L q which are related to the 
decomposition of R 3 as the union of unit cubes. Analyzing the solutions to (|37|) 
in those topologies, we find the following 

Theorem 17. There is a constant k > such that if — 1 < uj < and < e < 1, 

there is a function ip s € such that I' u e (ip £ ) — and 



Last step: Passing to the limit e — * 0. Eventually, we use Proposition [JJ)] 
to pass to the limit e — > 0. When obtaining the estimates (f^Uj) for the critical 
sequences of I U)E , we observe that the lower estimate for the norm 1 1 • 1 1^ is actually 
independent of e. Assume, by contradiction that the upper estimates do not hold 
for the sequence (ip £ )- Then, we consider the normalized functions 

and apply PropositionfTBlto the sequence (ipe)- Under the assumption that H^ellfl — * 
+00, we use all the previous estimates to infer that for j = 1, . . . ,p, 

p , 

V/ +u\v\ 2 dx = 0, Q(ip j )=0. 

But the latter implies that for every j, Cp ^ = a.e. and so, from the r.h.s. 
identity we obtain that (p- 7 = a.e. for all j. This contradicts Theorem [T7l □ 

1.6. Nonlinear Dirac evolution problems. The results which we have men- 
tioned so far are concerned with the existence of stationary solutions of various 
nonlinear Dirac evolution equations. These particular solutions are global and do 
not change their shape with time. The study of the nonlinear Dirac evolution 
problem 

id t V - D x ¥ + = 

*(0) = *o 

is also interesting in itself and, even if this is not the aim of the present paper, let 
us mention some references. 



(41) 
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For the case of local nonlinearities as the ones considered in this section, sev- 
eral works have proved well-posedncss for small initial data in well chosen Sobolev 
spaces. For nonlinearities containing only powers of \I/ of order p > 4, Reed proved 
in |139j the global well-posedness for small initial data in H s , s > 3. A decay 
estimate at infinity was also obtained in this paper. Later, Dias and Figueira [43] 
improved this result to include powers of order p = 3 and for s > 2. Najman [127] 
took the necessary regularity of the initial data down to H 2 . In [60] Escobedo 
and Vega proved an "optimal result" which states that for the physically relevant 
nonlinearities of order p > 3 of the type 

(42) G(*) := A|(**) £ ^ i /3* + 6(^,7 5 *) £ ^ i 75*|, 

there is local well-posedness of the evolution equation in H s , for s > | — jzj, when 
p is an odd integer, while s has to be in the interval (§ — ) otherwise. 

Moreover, if p > 3, then the problem is globally well-posed for small initial data 
in H s ( p \ with s(p) = | — — ^ . For a more recent result, see for instance a paper 
of Machihara, Nakanishi and Ozawa |121j . in which the existence of small global 
solutions is proved in H s for s > 1, and the nonrelativistic limit is also considered. 

An interesting question to ask is about the (linear or nonlinear) stability proper- 
ties of the stationary solutions with respect to the flow generated by the evolution 
equation. At present this seems to be a widely open problem (see [138] and [155] 
for a discussion). Recently, Boussaid [2Tj has obtained the first stability results, for 
small stationary solutions of nonlinear Dirac equations with exterior potential. 

Concerning the Cauchy problem for the Maxwell-Dirac equations, the first result 
about the local existence and uniqueness of solutions was obtained by L. Gross in 
[82] . Later developments were made by Chadam 34J and Chadam and Glassey 
[35] in 1 + 1 and 2 + 1 space-time dimensions and in 3 + 1 dimensions when the 
magnetic field is 0. Choquet-Bruhat studied in [38] the case of spinor fields of 
zero mass and Maxwell-Dirac equations in the Minskowski space were studied by 
Flato, Simon and Tallin in [73 . In [75], Georgiev obtained a class of initial val- 
ues for which the Maxwell-Dirac equations have a global solution. This was per- 
formed by using a technique introduced by Klainerman (see [1001 I101[ 1102] ) to 
obtain L°° a priori estimates via the Lorentz invariance of the equations and a 
generalized version of the energy inequalities. The same method was used by Bach- 
elot [5] to obtain a similar result for the Klein-Gordon-Dirac equation. Finally, 
more recent efforts have been directed to proving existence of solutions for the 
time-dependent Klein-Gordon-Dirac and Maxwell-Dirac equations in the energy 
space, namely C{—T,T- 1 H 1 I' 1 x H 1 ). The existence and uniqueness of solutions 
to the Maxwell-Dirac system in the energy space has been proved by Masmoudi 
and Nakanishi in [122] 1123] . improving Bournaveas' result in [22], where the space 
considered was C(-T, T; ffV 2 + e x H 1+£ ). 

Note that as mentioned above, the stationary states of the form ([36]) are particu- 
lar solutions of the Maxwell-Dirac equations. Physically they correspond to bound 
states of the electron. 

2. Linear Dirac equations for an electron in an external field 

When looking for stationary states describing the dynamics of an electron moving 
in an external field generated by an electrostatic potential V, one is led to study the 
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eigenvalues and eigenfunctions of the operator D c + V . If the electron has to enjoy 
some stability, the eigenvalues should also be away from the essential spectrum. 
In the case of not very strong potentials V, the essential spectrum of D c + V is 
the same as that of D c , that is, the set (—00, — c 2 ] U [c 2 , +00). So the eigenvalues 
which are of interest to us are those lying in the gap of the essential spectrum, i.e. 
in the interval (— c 2 , c 2 ). More precisely, in general a state describing an electron is 
always assumed to correspond to a positive eigenvalue. It is therefore important to 
be able to determine whether there are positive eigenvalues or not, and what is the 
behaviour of the 'first' eigenvalue when V varies (whether it crosses or dives into 
the lower negative essential spectrum for instance). Note that one expects that for 
a reasonable potential there are no eigenvalues embedded in the essential spectrum. 
Very general conditions on V which ensure nonexistence of embedded eigenvalues 
have been given by [HI Qj)] . Note finally that in this section c is kept variable. 

Formally, the eigenvalues of the operator D c + V are critical values of the 
Rayleigh quotient 

(43) <M« := "^y*' 

in the domain of D c + V . Of course, one cannot use a minimizing argument to find 
such critical points since, due to the negative continuous spectrum of the free Dirac 
operator, Qy is not bounded-below. Many works have been devoted to finding 
non-minimization variational problems yielding the eigenvalues of D c + V in the 
interval (— c 2 , c 2 ). Another important issue is to avoid the appearance of spurious 
states (some eigenvalues of the finite dimensional problem may not approach the 
eigenvalues of the Dirac operator D c + V ) as it has been the case in many proposed 
algorithms (see for instance 55J). W. Kutzelnigg has written two excellent reviews 
[1041 1105] on this subject, where many relevant references can be found. The main 
techniques which have been developed so far and used in practice can be divided 
in three groups: 

(1) Use of effective Hamiltonians whose point spectra are close to the spectrum 
of the Dirac operator in the limit c — > +00. For instance, one can cut at a 
finite level some infinite asymptotic formal expansion in negative powers of 
c. To this category of works belong for instance [55 1 l59 j fTTO l fTTT | \TT2 \ [105] . 

(2) Use of a Galerkin approximation technique to approach the eigenvalues, 
and this without falling into the negative continuum (— 00,— c 2 ). This is 
equivalent to projecting the equation onto a well-chosen finite dimensional 
space. This procedure has been well explained for instance in [56l [57l [T04] . 

(3) Replacement of the problematic minimization of Qy by another one. For 
instance, it was proposed to minimize the Rayleigh quotient for the squared 
Hamiltonian (D c + V) 2 (see, e.g. [1671 ITS] ) or later on, to maximize the 
Rayleigh quotients for the "inverse Hamiltonian" \p c ^y\2 (see [92] ). 

Before we go further, let us recall some useful inequalities which are usually used 
to control the external field V and show that D c + V is essentially self-adjoint. We 
recall that f) = _ff 1 / 2 (M 3 , C 4 ) and that are the positive and negative spectral 
subspaces of D\. 
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Proposition 18 (Hardy-like inequalities). The Coulomb potential W(x) = 
satisfies the following Hardy-type inequalities: 

(44) W < < \D C \, 



2 ~ 2c 











5- 


■i 









(46) W 2 < -4A < 4|D!| 2 . 

The inequalities of Proposition[TH]are classical (see, e.g. [HU [HE] for (0H)), except 
for ([45| . The proof of the latter is based on a method of Evans, Perry and Siedentop 
[67] and is contained in the recent papers 29, 160, 161 . 

2.1. A variational characterization of the eigenvalues of D c + V . Formally, 
the eigenvalues of D c + V lying in the gap of the essential spectrum should be 
described by some kind of min-max argument. This was mentioned in several 
papers dealing with numerical computations of Dirac eigenvalues before it was 
formally addressed in different contexts in a series of papers [53] [STJ [HOI HOI 02] ■ 

For the sake of clarity, we are going to present only a particular version of those 
min-max arguments allowing to characterize eigenvalues of the operator D c + V 
for appropriate potentials V. This method derives from a proposition made by 
Talman [158j and Datta, Deviah [42] and based on the decomposition of any spinor 
%j) = (fj as the sum of its upper and its lower components: 

This proposal consisted in saying that the first eigenvalue of D c + V could be 
obtained by solving the min-max problem 

(48) mm max — — . 

The first rigorous result on this min-max principle was obtained by Griesemer 
and Siedentop [ST] , who proved that (|48|) yields indeed the first positive eigenvalue 
of D c + V for potentials V which are in L°° and not too large. In [51], Dolbeault, 
Esteban and Sere proved that if V satisfies the assumptions 

(49) V(x) |xR?oo 0, 

(50) - ^- - Ki < V < K 2 = sup V{x) , 



(51) Kt,K 2 > 0, K x + K 2 - c 2 < \/c 4 - v 2 c 2 

with v e (0, c), Ki : K 2 € K, then the first eigenvalue \\{V) of D c + V in the 
interval (— c 2 , c 2 ) is given by the formula 

52) Xx(V) = mf sup —— , ip = [ 

Actually, under the conditions ([4^|) - ([50l) -([5T ]l . it can be seen that D c + V has an 
infinite sequence of eigenvalues {\k(V)}k>i converging to 1, and it was proved in 
[51j that each of them can be obtained by a min-max procedure: 
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Theorem 19. (Min-max characterization of the eigenvalues of D c + V [51]). Let 
V be a scalar potential satisfying {^9\j - f5Djl - \51\) . Then, for all k > 1, the k-th 
eigenvalue Xk{V) of the operator D c + V is given by the following min-max formula 

(53) AfeOO= inf sup \ T {V, tp) , 

Y subspacc of C~(K 3 ,C 2 ) V £Y\{0} 

di m y=fc 

where 

(54) A V, y> := sup — — 

is £/ie unique number in (K^ — c 2 , +oo) swc/i f/iai 

(55) A*(V,„) £ M'tr = / t ,( c ,f^^ y) + (c 2 + «0M")*- 

The above result is optimal for Coulomb potentials for which all the cases v G 
(0, c) are included. But note that assumptions (|5"0|) - (|5"Tj) can be replaced by weaker 
ones allowing in particular to treat potentials which have a finite number of isolated 
singularities, even of different signs. We describe some of these extensions at the 
end of this subsection. 

Theorem [JJJ] is a useful tool from a practical point of view in the sense that the 
first eigenvalue (case k — 1) of D c + V can be obtained by a minimization procedure 
over the (bounded-below) nonlinear functional ip i— > \ T (V 7 ip). Higher eigenvalues 
are obtained by the usual Rayleigh-Ritz minimax principle on the same nonlinear 
functional. As we shall see below, this has important consequences from a numerical 
point of view. 

Theorem [TO] is a direct consequence of an abstract theorem proved by Dolbeault, 
Esteban and Sere [51] . providing variational characterizations for the eigenvalues 
of self-adjoint operators in the gaps of their essential spectrum. 

Theorem 20. (Min-max principle for eigenvalues of operators with gaps [ST]) Let 
TL be a Hilbert space and A : D(A) C TL — > TL a self-adjoint operator. We denote 
by T{A) the form-domain of A. Let TL+, TL- be two orthogonal Hilbert subspaces 
ofH such that TL = TL+®TL~ and let A± be the projectors associated with TL±. We 
assume the existence of a subspace of D{A), F, which is dense in D(A) and such 
that : 

(i) F + = A + F and _F_ = A__F are two subspaces of J 7 (A). 

(m) a_ = SUp a ._ eJr _\{o} (X \\x%~ ] < +°° • 

Moreover we define the sequence of min-max levels 

(56) c k (A) = inf sup , k > 1, 

V subspace of F+ a; g (y©F_)\{0} \\ x \\ n 
dim V—k 

and assume that 

(Hi) c i(^-) > a - • 

Then 

Vfc > 1, c k {A) = /Life, 
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where, if b — inf (a ess (A) D (a_,+oo)) G (a_,+oo], denotes the k th eigenvalue 
of A (counted with multiplicity) in the interval (a_, b) if it exists, or fik — b if there 
is no k th eigenvalue. 

As a consequence, b — lim Ck(A) = supck(A) > a_ . 

k — >OC ' 

An important feature of this min-max principle is that the min-max levels do not 
depend on the splitting H = H+(BH- provided assumptions (i), (ii) and (Hi) hold 
true. In practice, one can find many different splittings satisfying these assumptions 
and choose the most convenient one for a given application. 

In order to treat families of operators without checking the assumptions of the 
above theorem for every case, there is a "continuous" version of Theorem [20] in [51] 
which we shall present now. 

Let us start with a self-adjoint operator Aq : D(Aq) C H — ► H. and denote by 
J-'(Aq) the form-domain of Aq. Now, for v in an interval [0, v) we define A v — 
Aq + vW where W is a bounded operator. The operator A v is self-adjoint with 
V(A V ) = V(A ), T(A V ) = T{A ). Let H = H + ®H~ be an orthogonal splitting of 
H, and P+ , P_ the associated projectors, as in Section 1. We assume the existence 
of a subspace of D(Aq), F, dense in D(Ao) and such that: 

(j) F + = P+F and F_ = P_F are two subspaces of ^(Aq); 

(jj) there is a_ el such that for all v S (0, 

(X—,A V X—) 
a u := sup — [j. < o_. 

x-eF_\{o} Has— 

For v S (0, 9), let b v := inf(a ess (A u ) n (a u , +oo)) , and for k > 1, let fi^,v be the 
fc-th eigenvalue of A v in the interval (a v , b v ), counted with multiplicity, if it exists. 
If it does not exist, we simply let fi^.u '■— b v . Our next assumption is 

(jjj) there is a+ > a_ such that for all v € (0, v), pL\ v > a+ . 

Finally, we define the min-max levels 

(57) c k ,v ■= hif sup ^' , k > 1 , 

V subspaco of F + a; G (y©F_)\{0} |F|| H 
dim V—k 

and assume that 
(jw) ci, > a_ . 
Then, we have the 

Theorem 21. ( [5 1 j ) Under conditions (j) to (jv), A v satisfies the assumptions (i) 
to (Hi) of Theorem \20\ for all v € [0, v), and Ck,u = l L k.v > for all k > 1. 

Theorems [2D] and [5T] are very good tools in the study of the point-spectrum 
of Dirac operators D c + vV , where V is a potential which has singularities not 
stronger than c/\x — xq\ (0 < c < 1). Of course, Theorem |2"T1 cannot be directly 
applied to the case of unbounded potentials, but this can actually be done by first 
truncating the potential and then passing to the limit in the truncation parameter, 
as we did in the proof of Theorem IT9l 

Theorem [211 is an easy consequence of the proof of Theorem [20l In contrast, the 
proof of Theorem 1201 is more involved. We sketch now its main steps. 
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Sketch of the proof of Theorem 1201 For E > a and x+ <E F+, let us define 

v- >-> <pe x x+(v-) = ((x++y-),A( x + + y-)) - E \\ x + + y-\\n- 

From assumptions (i) — (ii), N(y~) — \/(a + 1)| \yl \ \ 2 H — (y_, Ay J) is a norm on 

N 

F-. Let F_ be the completion of F- for this norm. Since ||.||, H < N on F-, we 

N 

have F_ c For all x + £ F + , there is an i e f such that A + x = x + . If we 
consider the new variable z_ = ?/_ — A_x, we can define 

ipE,x(z~) ■= (pE,A+x(z- + A_x) = (A(x + Z-),X + z-) - E(x + Z-,X + Z-) . 

Since F is a subspace of -0(^4), ipE,x (hence (fE,x + ) is well-defined and continuous 
for N, uniformly on bounded sets. So, <pe,x + has a unique continuous extension 

JV 

Tp E x on F_, which is continuous for the extended norm N. It is well-known (see 
e.g. |140j ) that there is a unique self-adjoint operator B : D{B) C H- — > W_ such 

iV 

that D(B) is a subspace of F_, and 

Va;_eZ)(B), iV(x_) 2 = (a+ + (x^BxJ). 

Now, Tp E ^ x+ is of class C 2 on F _ and 

£ 2 fe>-) •(?/-, 2/-) = -2(y- > By_)-2E\\y_\\ 2 n 

(58) < -2min(l,J5) JV^-) 2 . 

So Tp E x+ has a unique maximum, at the point y_ = The Euler-Lagrange 

equations associated to this maximization problem are : 

(59) A-Ax+ - (B + E)y_ = . 

The above arguments allow us, for any E > a, to define a map 
Q B : R 

(60) a;+ i-> Qij(a; + ) = sup ^ (ar_) = ^ (i B x + ) 

i_ef_ + 

= (^ - E)x+) + (A-Ax+, (B + E)~ 1 A-Ax+) . 

It is easy to see that Qe is a quadratic form with domain F + C 7Y+ and it is 
monotone nonincreasing in E > a. 

We may also, for E > a given, define the norm ue(x+) — \\x + + Lex+\\ h . We 
consider the completion X of F + for the norm n e and denote by He the extended 
norm. Then, we define another norm on F+ by 



M E {x+) = ^/Qe(x+) + (K E + l)(n E (x+)) 2 

with Ke = max(0, E ^7 Al ^ ) and consider the completion G of F + for the 
norm Me- Finally, we use the monotonicity of the map E i— > Qe and classi- 
cal tools of spectral theory to prove that the fc-th eigenvalue of A in the interval 
(0,inf (a ess (A) n (a, +oo))) is the unique Ek > a such that 

■ t QE h {x + ) 

(61) mf sup — — k - -2 = . 

V subspace of G !E+eV\{0} ( n E k (X+)) 
dim V—k 



24 



MARIA J. ESTEBAN, MATHIEU LEWIN, AND ERIC SERE 



Note that since the above min-maxes correspond to the eigenvalues of the operator 
T E associated to the quadratic form Qe, (|6lj) is actually equivalent to 

\ k (T E <°) = 0. 

and the fact that this inequality defines a unique relies on the monotonicity of 
Q E w.r.t. E. □ 

In the application of Theorem [20] to prove Theorems [19] and [21] various decom- 
positions H = 7i. + (S>7i~ could be considered. One that gives excellent results is 
defined by 



(62) V 



oK 



This decomposition yields optimal results about the point spectrum for some po- 
tentials V. There are cases for which this is not anymore true. For instance, this 
happens when the potential V has "large" positive and negative parts, case which 
is not dealt with in the previous results. 

Recently, Dolbeault, Esteban and Sere [53] have considered the case where a 
potential can give rise to two different types of eigenvalues, not only those ap- 
pearing in Theorems [19] and [21] More precisely, if V satisfies (|49|) , assume that 
it is continuous everywhere except at two finite sets of isolated points, {xf} , 
{x~}, i = !,...!, j = 1, . . . , J, where 



lim V(x) = +oo , lim V(x) \x — x^\ < 



(63) 



lim V{x) = -co , lim V(x) \x — £ • [ > — v 3 , 



with Vi, Vj € (0, c) for all i, j. Under the above assumptions, as above, D c + V has 
a distinguished self-adjoint extension A with domain T>(A) such that 

H^R^C 4 ) c V(A) c # 1/2 (M 3 ,C 4 ). 

The essential spectrum of A is the same as that of D c : 

cr css {A) = (-oo,-c 2 ] U [c 2 ,+oo), 

see [153 [Ml [12! [103]. Finally, V maps V(A) into its dual, since ([49]) and ([63]) 
imply that for all ip € ff 1 / 2 ^ 3 ), Vip S i/~ 1/2 (M 3 ). The decomposition of # 
considered here is related to the positive/negative spectral decomposition of the 
free Dirac operator D c : 

with $)± — P±Sj, where P± are the positive/negative spectral projectors of the free 
Dirac operator D c . 

As above, we assume the existence of a core F (i.e. a subspace of T>(A) which is 
dense for the norm || • II )' sucn that : 

(i) F + = P°F and F_ = P°_F are two subspaces of V(A)\ 



i \ — (a^-, Ax ) 



< +00; 



+ \ + ■ r ( x +< Ax + ) 

(11+) a+ := mf x+eF+ \ {Q} + x+f + > > 
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We consider the two sequences of min-max and max-min levels {A^~ }fc>i and {X k }k>i 
denned by 



(64) A+ := inf sup 

V subspacc of F + x& (V©F_)\{0} 
dim V—k 



(x, Ax) 



(x, Ax) 



(65) A fe := sup inf 

V subspacc of F_ X 6(V0F + )\{O} ||x 112 



dim V—k 



Theorem 22. Take a positive integer k and any k > fc and let A be the 
self-adjoint extension of D c + V defined above, where V is a scalar potential satis- 
fying IJty an d (SW - 

If a" < AjJ" < c 2 , then Aj is either an eigenvalue of D c + V in the interval 
(a~ ,c 2 ), or X^ = c 2 . If additionally V > 0, then a~ — c 2 and Xt — c 2 ■ 

If —c 2 < A^ < a + , then X^ is either an eigenvalue of D c + V in the interval 
(— c 2 ,a + ) or X^ = —c 2 . If additionally V < 0, then a + = —c 2 and X^ = — c 2 . 

The sequences {A^"}fc>i and {A^}fc>i are respectively nondecreasing and nonin- 
creasing. As a consequence of their definitions we have: 

(66) for all k > 1, A^ > max {a~,a + } and X^ < mm{a~ ,a + } , 

and if a - > a + , we do not state anything about the possible eigenvalues in the 
interval [a + ,a~]. Note that, as it is showed in [53], there are operators for which 
all or almost all the eigenvalues lie in the interval [a + , a - ] and thus, they are not 
given by the variational procedures defining the At. 's. 

Finally, let us remark that if we apply Theorem [2U to deal with a family of 
operators D c + rV, with V satisfying (j49|) - ([63|) , then we see that the eigenvalues 



A^ 's and A fe 's are of a "different" kind , since 

lim XT'* = ±c 2 , for all k > 1 . 

In physical words, we could say that the A^"'s correspond to electronic states, and 
the A^~'s to positronic ones. 

2.2. Numerical method based on the min-max formula. Let us now come 
back to the case k = 1 of Theorem \T§\ Note that from (|52p and (|54[) we see that 
(under the right assumptions on V) the first eigenvalue Xi(V) of D c + V in the 
gap (— c 2 , c 2 ) can be seen as the solution of a minimization problem, that is, 

(67) X 1 (V)= min A T (V» , 

v ec°°(R 3 .c 2 ) 

where <p >— > X T (V, ip) is a nonlinear functional implicitly defined by 

(68) X T (V, <p) [ M 2 dx=/ ( 2 C2 \ ( f: V Jf / ) + (c 2 + VM 2 )dx. 

V J R s\c 2 - V + X 1 {V,(p) J 

The idea of characterizing the first eigenvalue in a gap of the essential spectrum as 
the solution of a minimization problem is not completely new. It has for instance 
already been used by Bayliss and Peel [16] in another context. It is also close to 
the Fesbach method and to some techniques used in Pencil Theories. 

The fact that one can reduce the computation of Ai(V) to that of a minimization 
problem ([6T)) - ([S5)) has an important practical consequence: these problems ([6T| - (p5)l 
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can now be easily discretized to construct an algorithm, allowing us to approximate 
Ai(V) in an efficient manner. Indeed, the functional X T (V, •) to be minimized is 
bounded from below in the whole space i? 1 / 2 (R 3 ,C 2 ) . This is a huge advantage 
compared to other methods in which the total Rayleigh quotient Q v is minimized 
on particular finite dimensional subspaces of i? 1 / 2 (R 3 , C 4 ) : the latter are prone 
to variational collapse (entering into the negative continuum) and can even furnish 
spurious solutions (see, e.g. [55]). 

The discretization method based on (f67|) is completely free of all these complica- 
tions and satisfactory numerical tests for atomic and molecular models have been 
presented in j54[ [52] ■ Notice that molecular simulations are more complicated to 
carry on because one cannot use the rotational symmetry like in the atomic case. 
In contrast to the one-dimensional radially symmetric problem, the discretization 
has to be made in R 2 when axial symmetry is present, or in R 3 in the general 
case. Below we describe the algorithm that was used in [54j [52] to find eigenvalues 
of D c + V by the minimization method presented above. 

Consider the following approximation procedure for Xk{V), k > 1. Take any 
complete countable basis set B in the space of admissible 2-spinors X and let B n 
be an n-dimensional subset of B generating the space X n . We assume that B n 
is monotone increasing in the sense that if n < n', then B n is contained in B n i. 
Denote by ipi, ip%, . . . , tp n the elements of B n . For all 1 < i,j < n, we define the 
n x n matrix A n (X) whose entries are 

(69) Alf (A) = I ( ^^-V^ + (y + * - A) ) ^ 

The matrix A n {\) is selfadjoint and has therefore n real eigenvalues. For 1 < k < n, 
we compute Xk, n as the solution of the equation 

(70) /Xfc,„(A) = , 

where jUfe, n (A) is the fc-th eigenvalue of A n (\). Note that the uniqueness of such 
a A comes from the monotonicity of the r.h.s. of equation (|69[) with respect to A. 
Moreover, since for a fixed A 

(71) Mfc,n(A) \ MfePO as n -* +°° i 
we also have 

(72) A fc , n \Afc(F) as n -> +oo . 

The elements of the basis set used in [51] were Hermite functions. In [55] more 
efficient numerical results have been obtained by means of _B-spline functions. The 
interest of using well-localized basis set functions is the sparseness and the nice 
structure of the corresponding discretized matrix A n (X). If the degree of the basis 
of B-splines increases, the number of filled diagonals will also increase. So, a good 
balance has to be found between the smoothness of elements of the approximating 
basis set and the speed of the corresponding numerical computations. In [55] the 
simple choice of considering second order spline functions on a variable length grid 
was made. In the atomic case, when 1-dimensional .B-splines are used, very quick 
and accurate results can be obtained. In [55] numerical tests were provided for 
some axially symmetric diatomic molecules. 

In [109j we can find an algorithm which has some analogy with the algorithm 
described above. 
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2.3. New Hardy-like inequalities. Another byproduct of the minimization char- 
acterization of the first eigenvalue of D c + V given in |67|) and of (|55|) is the 
following: for all tp £ D(D C + V), 

In the particular case V — —v/\x\, v 6 (0, c) , (|73|) means that for all 99 <E 
(74) 

/ ( *M^L= + i<?-y/*=M)\vAdx>v[ ^rd X . 

h* \<? + v/\x\ + y/<*-v*<? /m j - 7 R3 

By scaling, one finds that for all tp € i/ 1 (R 3 ,C 2 ), and for all v G (0, 1), 

and, passing to the limit when v tends to 1, we get: 



Jr 3 V 1 + Vpl / Jm. 3 FI 

This inequality is a Hardy-like inequality related to the Dirac operator. It is not 
invariant under dilation, which corresponds to the fact that the Dirac operator D c 
is not homogeneous. But by another scaling argument, ([76]) yields, as a corollary, 
an inequality which is invariant by dilation, 

(77) / \x\\(a-V)p\ 2 dx> [ ^dx, 

JR3 JR3 Fl 

which is actually equivalent to the "classical" Hardy inequality 



(78) 



Jw 4Ji3 \x\ 2 



For a 4-dimensional version of (|77|) , see [1701 19"9"] . 

Finally, note that in [13] the Hardy-like inequality (176[) (and slightly more gen- 
eral ones also) has been proved by analytical means, without using any previous 
knowledge about the Coulomb-Dirac operator's spectrum. 

2.4. The nonrelativistic limit. Let us now indicate how we can relate the eigen- 
values of the Dirac operator to those of the Schrodinger operator. This relation is 
established by taking the limit c — > +00 , so by passing to the nonrelativistic limit. 
A ip with values in C 4 satisfies the eigenvalue equation 

(79) (D C + V)^ = X^ 

if and only if, writing tp = (^) with <p, \ taking values in C 

(80) 



with 



f vv mm \ I . M . I \ I I I V V i I I I \\ Ml 

R cX = (\-c 2 -V) <p, 
R c p = (X + c 2 ~V) X , 



3 Q 
R c = -ic(a.V) = - iccr 3 g^T- 
3=1 3 
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Recall that aj, j = 1,2,3, are the Pauli matrices. As long as A+c 2 — V^O, the 
system (|80|) can be written as 



(81) R c (^) + Vp = w, X =^ 

where g = fi + 2c 2 — V and \i — A — c 2 . 

Assume now that ip c — ( (/ ' c ) is an eigenfunction of the operator (D c + V) 
associated with the eigenvalue A c which satisfies 

liminf(A c — c 2 ) > — oo, limsup(A c — (?) < 0. 

c >oo c — 

The system (I5T1) can be written as 

(82) Mc + 2c 2 -^ + (M c + 2c 2 -y) 2 + V ^ = ^' X ° = » c + 2( ?-V > 

with /i c = A c — c 2 . It is then easy to prove (see [65]) that for c large, the functions 
(p c , which in principle are only in i? 1 / 2 (IR 3 , C 2 ) , actually belong to the space 
H 1 (M 3 ,C 2 ) and are uniformly bounded for the 7? 1 (1R 3 ,C 2 ) norm. Moreover, after 
taking subsequences, we can find <p G iJ 1 (R 3 ,C 2 ) and j2 < such that 

lim \\(p c - <p\\ni(s.so>) = 0, lim fi c = p,, 

and 

— — + Vip = Hf. 

Note that Xc the lower component of the eigenfunction ip c , converges to in the 
nonrelativistic limit. 

It can be proved that for all the potentials V considered in the theorems of this 
section, all the eigenvalues in the gap (— c 2 ,c 2 ) satisfy the above conditions, and 
converge, when shifted by the quantity — c 2 , to the associated eigenvalues of the 
Schrodinger operator — ^ perturbed by the same potential V. 

2.5. Introduction of a constant external magnetic field. The previous results 
were devoted to the case of a scalar electrostatic field V. The Dirac operator for 
a hydrogenic atom interacting with a constant magnetic field in the a^-direction is 
given by 



(83) D B := a ■ 



B 

-IV + —{-X2,Xi, 0) 



where v = Za > 0, Z being the nuclear charge number (we fix the speed of light 
c = 1 in this subsection) and B is a constant. 

The magnetic Dirac operator without the Coulomb potential v/\x\ has essential 
spectrum (— oo, —1] U [l,oo) and no eigenvalue in the gap (—1, 1) for any 6 £ 1. 
The operator D B has the same essential spectrum and possibly some eigenvalues 
in the gap. The ground state energy Ai (v, B) is the smallest among these. As 
the field gets large enough, one expects that the ground state energy of the Dirac 
operator decreases and eventually penetrates the lower continuum. The implication 
of this for a second quantized model is that electron-positron pair creation comes 
into the picture |129|, 1132] . The intuition comes from the Pauli equation, where 
the magnetic field tends to lower the energy because of the spin. It is therefore 
reasonable to define the critical field strength B{i>) as the supremum of the -B's for 
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which Ai(i/, b) is in the gap (—1, 1) for all b < B. As a function of v, \\{v,B) is 
non-increasing, and as a result the function B(y) is also non-increasing. Estimates 
on this critical field as a function of the nuclear charge v can be found in . They 
have been obtained by adapting to this case the variational arguments of Theorem 
[T9l One of the first results in this paper states that for all v £ (0, 1), 

0.75 „. , . / 18tw 2 alu 

(84) — < B{y) < ^(——^eS/v 

As a corollary we see that as v — ► 1 the critical field B(v) stays strictly positive. 
This is somewhat remarkable, since in the case without magnetic field the ground 
state energy \\(y, 0), as a function of v tends to as v — > 1 but with an infinite 
slope. Thus, one might expect very large variations of the eigenvalue at v = 1 as 
the magnetic field is turned on, in particular one might naively expect that the 
ground state energy leaves the gap for small fields B. This is not the case. 

Next, again by using the min-max characterization of \\{y, B\ it is shown in 
[15] that for v > small enough, and B not too large B) is asymptotically 

close to the ground state energy of the Coulomb-Dirac magnetic operator in the 
lowest relativistic Landau level co(^, B). This constant is proved to be given by 

(85) cq(v,B)= inf A£(/), 

where Ajf (/) is implicitly defined by 
(86) 

A B (/) / + °° \f(z)\ 2 dz= Jfjf^ B( > + (1 ~ va*{z)) \f{z)A dz, 

J-oo J-oo V 1 + A (/) + Va K z ) J 

and 

r + °° se-T- 

a£(z) = B / ds. 
° K ' Jo Vs^Tz^ 

In [48] it is proved that for B not too small and v small enough, 

(87) c (v + is 3/2 ,B) < M{v,B) < c { V ,B). 

and that since for v small, v z / 2 « u, 

c Q (y + i/ 3/2 , B) - c Q (v,B) - \i{v,B) as v^O. 

The one dimensional cq(v, B) problem, while not trivial, is simpler to calculate 
than the \\{v, B) problem. As a result, in the limit as v — > 0, this new theory yields 
the first term in the asymptotics of the logarithm of the critical field. In particular 
we have the following result, 

lim v\og(B(v)) = tt 

v— *0 

3. The Dirac-Fock equations for atoms and molecules 

In the previous two sections we described some results concerning the solutions of 
nonlinear or linear Dirac equations in R 3 , which represent the state of one electron 
only (or possibly many non-interacting electrons). We now want to present the 
Dirac-Fock (DF) model which allows to describe the state of interacting electrons, 
like for instance N electrons in a molecule. The DF model is very often used in 
quantum chemistry computations and usually gives very good numbers when the 
correlation between the electrons is negligible. It is the relativistic counterpart of 



30 



MARIA J. ESTEBAN, MATHIEU LEWIN, AND ERIC SERE 



the better known non-relativistic Hartree-Fock equations, which can indeed be seen 
as the non-relativistic limit (c — > oo) of the Dirac-Fock model as explained below. 
For this reason, we start by recalling briefly the Hartree-Fock model. 

3.1. The (non-relativistic) Hartree-Fock equations. The Hartree-Fock equa- 
tions are easily derived from the linear Schrodinger model in which one considers 
the following operator 

i=l v 7 l<i<j<N 1 n 

whose associated quadratic form describes the energy of N interacting electrons in 
the potential field V . Most often, V is the Coulomb electrostatic potential created 
by a positive distribution of charge v > 0, of total charge Z: 

V = — V*-, — r, / V = Z. 



In the case of M pointwise nuclei of charges z%, zm and located at x\ 1 ...,Xm, 
one takes v = X)m=i z m5x rrl and Z = 53m=i Zm ' extended nuclei can also be 
considered in which case v is assumed to be a smooth L 1 non- negative function. 

Due to the Pauli principle, the operator H acts on Aili £ 2 (K 3 x {±},C), that 
is to say the space of L 2 functions ^(a;i, oi..., Xn, crjy) which are antisymmetric 
with respect to the permutations of the (x^, <Tj)'s. When Z > N — 1, it is known 
[17H 1172) that the spectrum of H has the form a(H) = {Xi} U [£, oo) where {A^} 
is an increasing sequence of eigenvalues with finite multiplicity converging to the 
bottom of the essential spectrum S. We notice that the condition Z > N — 1 plays 
a special role even for the linear theory based on the operator ([55)) . as one knows 
[1651 196j that only finitely many eigenvalues exist below E when N > Z + 1, and 
that there is no eigenvalue below E when N > Z [1441 1151[ 11521 I115J . In the 
following, we shall always assume that Z > N — 1. 

In the Hartree-Fock approximation, one computes an approximation of the first 
eigenvalue Ai of H by restricting the quadratic form VP i— > (\&, H^) to the class of 
the functions \& which are a simple (Slater) determinant: 

(89) * = ipi A ■ ■ ■ A ip N 

where (ipi, y>jv) is an orthonormal system of L 2 (R 3 x {±},C) = L 2 (M 3 ,C 2 ), 
J M3 (<Pi, <fj)c 2 = Denoting tp^ — f^-^j 1HS1) means more precisely 

*(xi,t7i, ...,x n ,cfn) = -^=det(<Pi 3 (xj)). 

Since the set of all the ^/'s having the form |89[) is not a vector subspace of 
/\ i=1 i 2 (R 3 x {±},C), one then obtains an energy functional which is nonlinear 
in terms of (pi, <pN- The associated Euler-Lagrange equations form a system of 
N coupled nonlinear PDEs: 

(90) H< bV > k = \ k i Pk , k = l,...,N 

where i/$ is the so-called mean-field operator seen by each of the N electrons 

(91) ^ = __ + ( ^_ J/ ), ______ 
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with p$ being the (scalar) electronic density and 7$ the so-called density matrix of 
the iV electrons (this is a 2 x 2 matrix for any (x, y) £ R 3 x E 3 ): 

JV N 

(92) p $:= ^|(^| 2 and j$(x,y) :=y]ipj(x) ® <fij(y)*. 

We notice that ([9Tj) means 

for any tp e H 2 (R 3 ,C 2 ). The existence of solutions to ([90]) when J* R3 z/ = Z > N- 1 
has been proved first by Lieb and Simon [117) by a minimization method, and then 
by Lions [119] by general min-max arguments. See also [108j for a recent survey. 



3.2. Existence of solutions to the Dirac-Fock equations. The relativistic 
Dirac-Fock equations were first introduced by Swirles in [156j . They take the same 
form as the Hartree-Fock equations (j90[) . with — A/2 replaced by the Dirac operator 
D c . They are of course posed for functions taking values in C 4 instead of C 2 . Note 
however that when — A/2 is replaced by D c in the formula of the iV-body Hamil- 
tonian (|88p , one obtains an operator whose spectrum is the whole line R as soon as 
N > 2. To our knowledge, it is not known whether there exist or not eigenvalues 
which are embedded in the essential spectrum. In any case, the relativistic TV-body 
problem is not well-posed. This somehow restricts the physical interpretation of 
the Dirac-Fock model, compared to its non-relativistic counterpart. We refer to 
the next section in which a better model deduced from Quantum Electrodynam- 
ics is presented. Despite this issue, the Dirac-Fock equations have been widely 
used in computational atomic physics and quantum chemistry to study atoms and 
molecules involving heavy nuclei, and they seem to provide very good results when 
the correlation between the electrons is negligible. 

In the case of N electrons, the Dirac-Fock equations read 

(93) D c,<s><p k =e k (fk, k = l,...,N, 
where $ = (ipi, . . . , ipjf) satisfies J R3 (ifi(x), <fij{x)) dx — Sij, i.e. 

(94) Gram($) = 1, 
and 

(95) D c ,„ = D c + {p,s,-v)*±-~ l^lA , 

\x\ \x-y\ 



N N 



(96) l$(x,y) =^2<p t {z) ® ¥> e {y)*, P*(x) = tr ci (^(x,x)) = Y^lfA^W 



1=1 



Notice that 7$ (x, y) is a 4 x 4 complex matrix, and that the operator whose kernel 
is 7$ (a;, y), is nothing but the orthogonal projector onto the space spanned by 
ifi, <pn- We also denote it by 7$. 
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Indeed, like for the Hartree-Fock case, equations (|93|) are the Euler-Lagrange 
equations of the Dirac-Fock functional 



N N , 



(97) = Y,(v» D 'H>) -E(^' ["* i 



1=1 h e=i 



\x - 2/| 



on the manifold 



M := {$ = (<?!,. ..,^at) G (H 1 / 2 ^ 3 ^ 4 )^, Gram($) = l}. 

It will be important to notice that the functional only depends on the projector 
7$ defined in 



(98) £-(*) = "tr «D a + V)7*) " + 5 // f^^ip^d* * , 



! xR 3 

N 



where by "tr ((D c + V)^<s>) " we denote Y^((£) c + V)^j, <^j). Note that this expres- 

i=l 

sion is really a trace if the (p^s are in H 1 (M 3 , C 4 ). 

As a matter of fact, the Euler-Lagrange equations of on M. only depend on 
the space spanned by (<p\, ...,ipn). This explains why, up to a rotation of the ipi's, 
one can always assume that the Lagrange multiplier matrix associated with the 
constraint (|94|) is diagonal, with diagonal elements (ei, ...,£n) appearing in (f93l) . 



Finding solutions of (f9"3"| is then reduced to finding critical points of the func- 
tional on the manifold M. Once again, the unboundedness (from above and 
below) of the spectrum of the free Dirac operator makes the functional £^p totally 
indefinite. This together with the a priori lack of compactness of the problem posed 
in R 3 and the fact that we have to work on a manifold M. and not in the whole 
functional space, makes the variational problem difficult. A minimization proce- 
dure is once again impossible and another method has to be found. In |64j , Esteban 
and Sere defined a penalized variational problem (see below for details) which can 
be solved by first maximizing on some part of the spinor functions ifi and then 
defining a more standard min-max argument for the remaining functional, together 
with Morse index considerations. 

The theorem proved in [M] and improved later by Paturel in [131] states the 
following: 

Theorem 23. (Existence of solutions to the Dirac-Fock equations [6411131] ) With 
the above notations, assume that N and Z = J R3 v are two positive integers satisfy- 
ing max(Z, N) < an< ^ N — 1 < Z. Then, there exists an infinite sequence 
(<& c ' J ) j>0 of critical points of the Dirac-Fock functional on M. The functions 
(pi , . . . (p^ satisfy the normalization constraints {94\j and they are strong solu- 
tions, in H 1 / 2 (M 3 ,C 4 ) H rii< g <3/2 W^iM 3 ,^), of the Dirac-Fock equations 

(99) D c ^ 3 tf* = & <{& , 1 < k < N , 
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(100) < e* 3 < ... < < c 2 . 

Moreover, 

(101) < £d'f($ cj ) < Nc 2 , 

(102) lim EZ&ip **) = Nc 2 . 

Remark 24. In our units, taking into account the physical value of the speed of 
light c, the above conditions become 

N < Z < 124. 

The proof of the above theorem is done by defining a sequence of min-max 
principles providing critical points of increasing Morse index. We notice that the 
solution <£> c, ° obtained by Theorem [221 when j = will play an important role, 
since for c large it will be actually interpreted as an "electronic ground state" (see 
below) . 

The condition Z,N < 2c/(2ir + ir/2) is not that unnatural, since already in the 
linear case such a condition was necessary to use Hardy-like inequalities ensuring 
the existence of a gap of the spectrum of D c + V around 0, see (|45|) . Notice however 
that in [64j the following additional technical assumption was used: 

(103) 3JV- 1< , 2C fW ■ 
v ' 7:12 + 2/11 

This assumption was removed by Paturel shortly afterwards in [131] . He did so 
by studying a finite dimensional reduction of the problem and then passing to the 
limit, after having obtained the necessary bounds. This proof was done in the spirit 
of the Conley-Zehnder proof of the Arnold's conjecture [41]. The sketch of the proof 
that we give below is that of [64] because the variational arguments are easier to 
explain in that case. 

Sketch of the proof of Theorem l23l when (I103|) holds. The first (and smallest) 
difficulty here is that v*-ri is not a compact perturbation of D c when v — ZSq. This 
creates some technical problems. They are easily solved, replacing the Coulomb 
potential jij by a regularized potential Vs. The modified energy functional is 
denoted now by £^p V - At the end of the proof, we shall be able to pass to the 
limit rj — > 0. 

The second difficulty is that the Morse index estimates can only give upper 
bounds on the multipliers e c k 3 in ([55]) . But we also want to ensure that e^ 3 > 0, 
since these multipliers are interpreted as the energies of the different electrons. To 
overcome this problem, we replace the constraint Gram ($) = 1 by a penalization 
term 7r p ($), subtracted from the energy functional: 

(104) tt p ($) = tr (Gram (l - Gram $) 

In this way we obtain a new functional J- c ,r/,p = ^df" 7 — > defined now on the set 
of <f>'s satisfying 

< Gram $ < 1 . 

Since in the basis in which 

Gram $ = Diag(ai, . . . , &n) , < o\ < ■ ■ ■ < ajy < c 2 , 
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the matrix ^Gram $J ^1 — Gram <t>J is also diagonal and equals 

x p 

Dia g(/ P (o-i): • • • , ,/pOaO) , with f p {x) := , 

1 — x 

the corresponding Euler-Lagrange equations are then 

The numbers e^ 3 := /p(c^ J ) are now explicit functions of tp\f ■ Thus, f p being an 
increasing function, we automatically get el > 0. 

The third difficulty with DF, is that all critical points have an infinite Morse 
index. This kind of problem is often encountered in the theory of Hamiltonian 
systems and in certain elliptic PDEs. One way of dealing with it is to use a concavity 
property of the functional, to get rid of the "negative directions" , see e.g. j2l [23 
1551 |3"T] . This method was used in 64J . Doing so, we get a reduced functional 
I c ,r],p- A min-max argument gives us Palais-Smale sequences ($^' ) ' p )n>i for I c ,r),p 
with a Morse index "at most j" (up to an error which converges to as n — > oo), 
thanks to [551(77]. Moreover, adapting the arguments of [1 19] . we prove that the 
corresponding £ ^- rl -P- n are bounded away from c 2 . Finally, the assumptions made 
on Z, N guarantee that the £ c ^ 3 ' rhV > n are also bounded away from 0, uniformly on 
r], p and n. Then we pass to the limit (rj,p,n) — > (0,oo,oo), and get the desired 
solutions of DF, with < < c 2 . 

The fact that we recover at the limit p — > +oo the constraint Gram $ = 1 is a 
consequence of the a priori estimates 

Q<e<e?™ = 

with e independent of rj.p. The properties of the function f p and the above in- 
equality imply that as p — > +oo , one necessarily has o~ c,J '' n ' p — > f , which of course 
is equivalent to saying that in the limit there is no loss of charge: L 3 |<^ J | 2 = 1 • 

The method that we have just described can be generalized. In [56] an abstract 
version is provided, with applications to nonlinear periodic Schrodinger models 
arising in the physics of crystalline matter. 

The concavity argument of [64] works only if (|103[) holds. In his theorem, Paturel 
[131] got rid of this assumption my making a finite dimensional reduction first. This 
allowed him to deal with finite Morse indices again. □ 

3.3. Nonrelativistic limit and definition of the Dirac-Fock "ground state". 

As in the case of linear Dirac equations of Section 3, it is interesting here to see 
what is the nonrelativistic limit of the Dirac-Fock equations. We shall recover in 
the limit the Hartree-Fock equations ([50]) presented above, for the two-dimensional 
upper component of the (fi's (recall that <pi,...,<pit are C 4 -valued functions), the 
lower component converging to zero. This was proved rigorously in [65] . This 
result has been of importance to better understand the variational structure of the 
Dirac-Fock problem and in particular to obtain a good definition of an electronic 
ground-state energy, which is a priori not clear because of the unboundedness of 
the Dirac-Fock energy. 

Theorem 25. (Non-relativistic limit of the Dirac-Fock equations [65]) Let be N < 
Z+l. Consider a sequence of numbers c n — > +oo and a sequence ($")„ of solutions 
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of nm, i.e. $ n = (pj, eocA^ feeing in F 1 / 2 ^ 3 , C 4 ), 

and D Cri ^n(p% = ejjv'fc ■ Assume that the multipliers ejj, fc = 1, 



wii/i Gram $ n = 1 
. . . ,N, satisfy 



< £ < £? < 



^jy ^ u — £', with e,e' > independent of n. 

Then for n large enough, each tp^: is in iJ 1 (M 3 ,C 4 ) ; and i/iere exists a solution of 
the Hartree-Fock equations (|90p . $ = (<^i,-*- , <^jv) G -ff^lR 3 , C 2 )^, with negative 
multipliers, Ai, Ajv, swc/i £/iai, a/ier extraction of a subsequence, 

(105) lim (^ - (c„) 2 ) = A fc> 



(106) ^ = 
(107) 

for all k = 1, 
(108) 



(3) 
















* / 





in i^M^C 2 ) x if 1 (I 



2 ). 



i 2 {ffi 3 ,C 2 ) 



... TV. and 



Net 



0(l/(c n 



The Hartree-Fock energy £hf appearing in (| 1 08|) is the same as (|97|) . but with 
D c replaced by —A/2 and C 4 by C 2 . It can be proved [65] that the critical points 
constructed in Theorem [23] all satisfy the assumptions of Theorem [25] for any fixed 
j. Therefore, all the <I >CJ converge as c — > oo to a state whose upper component is a 
solution of the Hartree-Fock equations, and whose lower component vanishes. This 
result can even be made more precise in the case of one of the "first" solutions (i.e. 
corresponding to j = 0): the critical point <!> c ' does not converge to any solution 
of the HF equations, but actually to a Hartree-Fock ground state, as stated in the 
following 

Theorem 26. (Non-relativistic limit of the Dirac-Fock "ground state" [65]) As- 
sume that N < Z + 1 and v are fixed, and that c n — > oo . Then the critical point 
<£>c„,o constructed in Theorem ] 23\ for i — satisfies 

lim {£^(^°)-Ncl} 



Up to a subsequence, ($ c "'°) converges as c n 
of £hf • 

Furthermore, for c n large enough, the £ c k 
(109) 



mm 

Gram <I>=1 



£hf($). 



oo to 



where $o is a minimizer 



are the first positive eigenvalues of 



7* c 



X[o,, 



We notice that (|109[) means that the last level £^'° is necessarily totally filled. 
In other words, similarly to the Hartree-Fock case [7], "there are no unfilled shell 
in the Dirac-Fock theory for c^S> 1" . 

Although the Dirac-Fock functional £^p is not bounded-below, Theorem [25] al- 
lows to interpret the first min-max solution $ c, ° (any of them, since there is no 
uniqueness) as an electronic ground state, since it converges to a Hartree-Fock 
ground state in the nonrelativistic limit. Actually, more has been proved in [65j : 
<jjc,o m( jeed minimizes the Dirac-Fock functional among all the <E> = (ipi, ipjy) 
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Figure 1. For c> 1, the Dirac-Fock 'ground state' $ c, ° contains 
the eigenfunctions associated with the N first positive eigenvalues 
of the mean-field operator D c ^,o. 

such that each ifi belongs to the positive spectral subspace of the mean-field oper- 
ator D c $: 

Theorem 27. (Variational interpretation of the Dirac-Fock "ground state" [65] ) 
Assume that N < Z + 1 and v are fixed, and that c is sufficiently large. Then <I> C:0 
is a solution of the following minimization problem: 

(110) inf | $= ((p u ...,<p N ), Gram$ = l, X(-oo,o)(^a,») * = 0} 

where X(-oo,o)(^c,*) denotes the negative spectral projector of the operator D c ^>, 
and 

X(-oo,0) (D c ,<s>) $ = 

means 

X(-oo,o) (-D c ,#) ¥k = 0, 

for any k = 1, N. 

The interpretation of the theorem is the following: although the Dirac-Fock 
energy is unbounded from below, the critical points $ c '° are, for c large enough, 
the minimizers of £pp on the set of functions satisfying the nonlinear condition 
X(-oo,o) (-Dc,*) $ = 0. Calling these functions electronic, we obtain that for c large, 
$ c '° is really an electronic ground state. In particular, it is also a minimizer of £^p 
among all the solutions of Dirac-Fock equations with positive Lagrange multipliers. 

Remark 28. Recently, explicit bounds on c under which Theorem \27\ is valid have 
been provided by Ruber and Siedentop [94j . 

4. The mean-field approximation in Quantum Electrodynamics 

In this last section, we want to present some progress that has been made re- 
cently concerning models from Quantum Electrodynamics (QED) in which, instead 
of trying to 'avoid' the negative spectrum of D c , the latter is reinterpreted as Dirac's 
vacuum and completely 'incorporated' into the model. The somehow surprising con- 
sequence will be that, contrary to the previous sections, the energy functional will 
be bounded from below. The price to pay is that one has to deal with infinitely many 
particles (many of them being 'virtual') instead of the finite number of electrons as 
previously. 

As mentioned in the introduction, Dirac interpreted the negative spectrum of 
his operator D c as follows [351 H51 [J7] . Since there exists no electron of negative 
kinetic energy, one has to find a way of avoiding the negative spectrum. This is 
done by assuming that the negative spectrum energies are all occupied by virtual 
electrons, one in each energy state, and that this (virtual) distribution of charge is 
not felt by the real particles on account on its uniformity. Then, the real electrons 
can in general only have positive energies due to the Pauli principle which prevents 
them to be in the same state as a virtual particle. 
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Mathematically, Dirac's postulate is interpreted as follows: the states of the real 
free electrons necessarily belong to the positive spectral subspace ff + ^ c — P+ c Sj 
defined in Proposition [TJ and the vacuum (Dirac sea) is described by the infinite 
rank spectral projector P°_ c . Recall that in Hartree or Dirac-Fock theories, the 
density matrix 7$ of the N electron state ^ = <p\ A • • • A (fiN is precisely the orthog- 
onal projector onto span(^i, (Pn)> see, e.g. formulas (189]) and ([92]) . Therefore, 
when one represents the Dirac sea by the projector P°_ c , one describes formally the 
vacuum as an infinite Slater determinant 

(111) 0° = ipi A — Aifi A--- 

where (ifii)i>i is an orthonormal basis of Sj a _ c = P° c fj. Dirac's postulate is based 
on the important invariance by translation of the projector P^_ c , which is used to 
neglect the physical influence of this 'constant background'. 

It was realized just after Dirac's discovery that, for consistency of the theory 
the vacuum should not be considered as a totally virtual physical object which 
does not interact with the real particles. Dirac himself [45,|46l[47j conjectured the 
existence of surprising physical effects as a consequence of his theory, which were 
then experimentally confirmed. First, the virtual electrons of the Dirac sea can 
feel an external field and they will react to this field accordingly, i.e. the vacuum 
will become polarized. This polarization is then felt by the real particles and one 
therefore is led to consider a coupled system 'Dirac sea + real particles'. From 
the experimental viewpoint, vacuum polarization plays a rather small role for the 
calculation of the Lamb shift of hydrogen but it is important for high-Z atoms [126] 
and it is even a crucial physical effect for muonic atoms [751 [75]. Second, in the 
presence of strong external fields, the vacuum could react so importantly that an 
electron-positron pair can be spontaneously created [1291 11411 11421 1143] . 

The mathematical difficulties of a model aiming at describing both the Dirac 
sea and the real particles are important, for one has to deal at the same time 
with infinitely many particles (the real ones and the virtual ones of the Dirac 
sea). In the following, we present a Hartree- Fock (mean-field) type model for this 
problem, which has been mathematically studied by Hainzl, Lewin, Sere and Solovej 
83, 84, 85, 86J. The model under consideration is inspired of an important physical 
article by Chaix and Iracane [36j in which the possibility that a bounded-below 
energy could be obtained by adding vacuum polarization was first proposed. But 
the equations of this so-called Bogoliubov-Dirac-Fock model were already known 
in Quantum Electrodynamics (QED) [142j . For the sake of simplicity, we shall 
not explain how the model is derived from the QED Hamiltonian and we refer to 
[83] 184] 185] [86] and the review [87]. In the version studied in these works, the 
electromagnetic field is not quantized (photons are not considered). 

Let us now describe formally the mean- field approximation in QED, following 
mainly [86] [87] . The state of our system will be represented by an infinite rank 
projector P. This projector should be seen as the density matrix of an infinite 
Slater determinant made of an orthonormal basis of the subspace PSj, like (|llip . 
The projector P describes not only Dirac's vacuum but the whole system consisting 
of the infinitely many virtual particles of the vacuum, together with the finitely 
many real particles (they could be electrons or positrons). Indeed, it is important 
to realize that in this model there is no a priori possible distinction between the real 
and the virtual particles. For N electrons, this would correspond to a decomposition 
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of the form P = P vac + 7 where 7 is an orthogonal projector of rank N satisfying 
Pvac 7 = 7-Pvac = (for N positrons, this becomes P = P vac — 7)- But there are 
infinitely many such decompositions for a given P and a given N: it will only be for 
the final solution of our equation that this decomposition will be done in a natural 
way. 

For the sake of simplicity, we take c = 1 except at the very end of this section. 
We recall that in this case an additional parameter a — e 2 appears, where e is the 
charge of the electron. The energy in the state P of our system in the presence of 
an external field V = —av * | • l^ 1 is then formally given by [86] 

(112) S v (P-l/2) 

where £ v is the Dirac-Fock functional expressed in term of the density matrix, see 
(EH]), 



(113) £ V {T) := tr(£>ir) - a (v * y-^j {x)pr{x) dx 



2 J jr 3 xr 3 \x-y\ 2JJ R 3 XR3 \x-y\ 

The density of charge v which creates the field V can for instance represent a system 
of nuclei in a molecule. But in the following we shall not allow pointwise nuclei as 
we did for the Dirac-Fock model, and v will essentially be an L\ oc function. This is 
not important for pointwise particles do not exist in nature. 

The subtraction of half the identity in (| 1 1 2[) is a kind of renormalization which 
was introduced by Heisenberg [89] and has been widely used by Schwinger (see [1471 
Eq. (1.14)], [UHl Eq. (1.69)] and [1491 Eq. (2.3)]) as a necessity for a covariant 
formulation of QED. The importance of this renormalization will be clarified below. 

Before we come to the problems of definition of the above energy (which are 
numerous), let us mention briefly how £qed * s supposed to be used. There are 
two possibilities. If one wants to find the state of the vacuum alone in the field 
V (no particle at all), then one has to minimize P 1— > £ v [P — 1/2) on the whole 
set of orthogonal projectors. When v — 0, one should obtain the free vacuum, 
a translation-invariant projector which is supposed to be physically unimportant. 
When v 7^ 0, one should obtain the 'polarized vacuum in the presence of the field 
V. It can formally be seen that such a minimizer P should be a solution of the 
following nonlinear equation: 

(114) P = Xi-ooM (A + *(p [P -i m " ") * p| " ^'^y^ ) ■ 

If one wants to describe a system containing TV real particles (or more correctly of 
total charge —eN), then one has to minimize P 1— > £ V (P— 1/2) under the additional 
constraint tr(P — 1/2) = N. Usually this will provide us with a projector P which 
describes adequately the state of a system of charge —eN interacting with Dirac's 
vacuum. It is formally solution of the following nonlinear equation 

(115) P = X(-oo,„] (r>i + - v) * A - a {P - 1/2)i *' y) ) , 
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fj, being a Lagrange multiplier due to the charge constraint. If v is not too strong, 
the mean-field operator 

P := Di + a(p [P _ 1/2] - u )*j-- a j— — -j 

will have exactly iV eigenvalues in [0, fj] (counted with their multiplicity). Therefore, 
there will be a natural decomposition 

N 

P = X(-oo,Q) P) + X[0,n] P) = X(-oo,0) iP) +^2<Pi®<Pi 

1=1 

where the tpi's are solutions of 

(116) T><p k =£ k cp kl e k > 0, 

which is the natural decomposition between the real and the virtual particles men- 
tioned above. It will be shown below that Equations (|116p are very close to the 
Dirac-Fock equations (|93p. 

The goal is now be to give a mathematical meaning to this program. 

First, one has to face an important difficulty: when P is an orthogonal projector 
and Sj is infinite dimensional, then P — 1/2 is never compact. Therefore, none of 
the terms of (|113j) makes sense. In [86], Hainzl, Lewin and Solovej tackled this 
problem in the following way: they considered a box Cl '■= [-L/2, L/2) 3 of volume 
L 3 and replaced the ambient space Sj by the space Lp er (Ci) of L 2 functions on Cl 
with periodic boundary conditions, with moreover a cut-off in Fourier space. Doing 
so, the problem becomes finite dimensional and everything makes perfect sense. 
Then, when no charge constraint is imposed, they took the thermodynamic limit 
L — > oo with the Fourier cut-off fixed and proved that the sequence of minimizcrs 
converges to a limiting projector P satisfying equation (|114|) . In this way, they 
could give a meaning to the minimization of the QED functional (|112p and to the 
equation (|114|) . Moreover, they also obtained the so-called Bogoliubov-Dirac-Fock 
(BDF) energy which attains its minimum at the limit state P. This energy has been 
studied by Hainzl, Lewin and Sere in [EH [84j [85] . In the following, we describe all 
these results in more detail. 

Notice that the Fourier cut-off will not be removed in this study. Indeed it is 
well-known that QED contains important divergences which are difficult to remove. 
Therefore 5) will be replaced by the functional space 

fi A := {/ € L 2 (M 3 ,C 4 ), supp(/) C 5(0, A)} . 

The ideas described in this section have recently been adapted [30] to a nonlinear 
model of solid state physics describing nonrelativistic electrons in a crystal with a 
defect. 

4.1. Definition of the free vacuum. We start by explaining how the free vacuum 
was constructed in [86], in the case v = 0. This is done by defining the energy (|113p 
for projectors P acting on the finite-dimensional space 

£a := {/ G L 2 pcr ([-L/2,L/2) 3 1 C 4 ), supp(/) C 5(0, A)} 
= span {exp(ifc -x), ke (2tt/L)Z 3 n 5(0, A)} . 
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To define the energy properly, it is necessary to periodize the Coulomb potential 
as follows: 

( \ 



Wl(x) = ^ 



\ ke(27r)Z- 1 /L I 
\ fe^O / 



fc#0 

where w is some constant which is chosen such that minc L Wt — for any L. The 
Dirac operator D\ is also easily defined on it is just the multiplication of the 
Fourier coefficients by (-Di(fc))fc£2-n-z 3 /z,- Then, one introduces 

(117) £g(T):=tr j5i (£> 1 r) + +! [[ W L (x - y)p r (x)p r (y) dx dy 



ClxC l 



W L {x-y)\T{x,y)\ z dxdy, 

C L xC L 



for any self-adjoint operator T acting on Sjj^. The kernel T(x,y) of T is easily 
defined since f)\ is finite-dimensional. Its density pr is then defined as pr(x) := 
ti£i(T(x,x)). A translation-invariant operator T acting on is by definition a 
multiplication operator in the Fourier domain. In this case, one has T{x,y) = 
f(x — y) for some / and therefore px is constant. The identity of S)j(, denoted by 
ifc is an example of a translation-invariant operator. 
The first result proved in [SB] is the following: 

Theorem 29. (QED mean-field minimizer in a box [5S]) Assume that < a < 4/tt, 
A > and that L is large enough. Then the functional £ 1 has a unigue minimizer 
T Q L on the convex set 

Gl ■.= {r e C(f&), v* - r, -7^/2 < r < i£/2} . 

It is invariant by translation and satisfies p r o = 0. Moreover, it takes the form 
= — I\/2 where "P£ is an orthogonal projector on 

Notice that in the definition of the variational set Q](, we did not consider only 
operators taking the form P—lfc /2 , where P is an orthogonal projector as suggested 
by (|112p . but we indeed extended the energy functional to the convex hull of this 
set. But, as usual in Hartree-Fock type theories |113j . the global minimizer is always 
an extremal point, i.e. a state taking the form P — lJ(/2. 

Of course, it can easily be shown that V^ L satisfies an equation similar to (|114p 
with v removed and l/|x| replaced by Wl- But we do not give the details since we 
are more interested in the limit of as L — > oo. 



To be able to state the thermodynamic limit correctly, one needs first to introduce 
the translation-invariant projector acting on Sja, which will be the limit of the 
sequence (V°)l- The identity of S)\ is denoted by I\. We introduce 
(118) 

T(A) = 1 / tr c 4D°( P )A(j>)] dP " TOT // ^f^ dpdg 

for any A belonging to the convex set 

Aa '■= {A translation-invariant on f)\, A* — A, —I\/2 < A < /a/2} . 
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It will be shown in Theorem[3T] below that T represents the energy per unit volume 
of translation-invariant operators. For this reason, one now considers the minimiza- 
tion of T on Aa • The following was proved in [55] : 

Theorem 30. (Definition of the free vacuum [86] ) Assume that < a < A/tt and 
A > 0. Then T possesses a unique global minimizer T° on Aa- It satisfies the 
self- consistent equation 

sgn(2?°) 



(119) 



r° = -- 



f - y\ 

or, written in terms of the translation-invariant projector V_ = T n + I\/2, 

(120) V°_ = X( -oo,o) ■ 
Moreover, T>° takes the special form, in the Fourier domain, 

(121) V°(p)= gi (\p\) a -p + g Q (\p\)P 

where go, g\ € L°°([0, A),R) are such that 1 < gi(x) < go(x) for any x £ [0, A), 
and therefore 

(122) | J D 1 (p)| 2 <|P (p)| 2 <.g (H)|^ 1 (p)| 2 . 

The self-consistent equation (|119[) has already been solved by Lieb and Siedentop 
in a different context [116] . They used a fixed point method only valid when 
a log A < C for some constant C. 

As shown by the next result, the negative spectral projector "P° of the Dirac- 
type operator T>° represents the free vacuum, as it is the limit of the sequence 
■p£ when L — > oo. An important property of T° showing the usefulness of the 
subtraction of half the identity in (|112p is the following. Due to 

v°_( P ) - j a(p )/2 = r» = - g^p^W , 

one infers 

trc(r°(p)) = tr c 4(V°_ - J A /2)(p)] - 0, 
for any p £ _B(0, A), the Pauli matrices being trace-less. This has the important 
consequence that the (constant) density of charge of the free vacuum vanishes: 

pro = (2tt)- 3 / tr c 4T°(p))dp = 0, 
Jb(q,a) 

which is physically meaningful. This formally means that 

(123) "tr (V° -Ik/2) = [ p r odx = 0\ 

Jr 3 

We notice that V®_ is not Dirac's original choice P° (except when a = 0) because 
the interaction between the particles (the virtual and the real ones) is taken into 
account by the model. Notice also that Equation (|120[) is exactly the same as (|114[) 
with v = 0, due to (TlT!)|) . 

As a consequence of (|12ip . the spectrum of T>° is 

a(©°) = {±V5o(bl) 2 +5i(bl) 2 b| 2 , P£B(0,A)}. 
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It has a gap which is greater than the one of D°, by (|122j) : 

(124) 1 <m{a) :=min a (\V°\). 

In |85j . it is proved that when a <C 1, then 771(a) = go(0) and conjectured this is 
true for any < a < A/tt. Notice that the following expansion is known [116[ 186] : 
go (0) = 1 + a arcsinh(A) + 0(a 2 ). 

FIGURE 2. The free vacuum V*L is the negative spectral projector 
of the translation-invariant operator T> . 

We are now able to state the thermodynamic limit, as proved in |86j 

Theorem 31. (Thermodynamic limit in the free case [86 ) Assume that < a < 
4/ V and A > 0. Then, one has 

lim = minT, 

L— >oo L J -Aa 

where we recall that is t/ie unique minimizer of defined in Theorem \29l 
Moreover, V Q L = T° L + 1% /2 converges to V?_ in the following sense: 

lim \\V° L - V Q \\ & { L = lim sup \V° l (p) - V-(p)\ = 0. 

4.2. The Bogoliubov-Dirac-Fock model. Now that the free vacuum V®_ has 
been correctly defined, we will be able to introduce the Bogoliubov-Dirac-Fock 
(BDF) energy as studied in [83l [M 13 IS] • Formally, it measures the energy (|112[) 
of a state P, relatively to the (infinite) energy of the free vacuum Vt . It will only 
depend on P — V°_ and reads formally: 

U £BDF(P-'P -) = ^QED^-^QED^-) 

(125) = £"(P-I A /2) ~£°(V°_ -Ik/2) 

PQ(x)v{y) 



tr (TfiQ) all 

J JR2 



V 



-dx dy 



a 



(126) P -^^ld X dy-Uf ^f^", 

2 JJr3 X r3 \x-y\ 2 JJ R i xR 3 \x-y\ 

with Q = P — . This new energy looks again like a Hartree-Fock type functional 
except that our main variable is Q = P—V^_, which measures the difference between 
our state and the (physically unobservable) translation-invariant free vacuum V®_ . 

The energy f|126[) was introduced and studied by Chaix-Iracane in [36] (see also 
Chaix-Iracane-Lions |37j). An adequate mathematical formalism was then provided 
by Bach, Barbaroux, Helffer and Siedentop [6] in the free case v = 0, and by Hainzl, 
Lewin and Sere (83] [84] in the external field case v ^ 0. However, in all these works 
a simplified version was considered: 2?° and were replaced by Dirac's choice D\ 
and P2_. As mentioned in [86], although the choice of for the free vacuum is 
better physically, the two models are essentially the same from the mathematical 
point of view: the main results of [U [83l [84] can be easily generalized to treat the 
model in which T>° and V°_ are used. 
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What is gained with ()126|) is that Q can now be a compact operator (it will 
be Hilbcrt-Schmidt, indeed) and, thanks to the Fourier cut-off A, many of the 
terms in (|126[) will be mathematically well-defined. However, it will be necessary 
to generalize the trace functional to define correctly the kinetic energy tr (T>°Q). 
This is done by introducing the following space 

Sffe) := {Q £ 6 2 (£a) | V°_QV°_, V\QV\ £ 6i(£a)} , 
with the the usual notation 

6 p (£a) := {A £ C{Si A ), tr(\A\*>) < oo}. 

An operator Q belonging to & 1 ~ (S)a) is said to be V^-trace class. For any such 

■p 

Q £ &i ~ (Sja), we then define its V°_-trace as 

tr P o (Q) := tr{V°_QV°_) + tr(V° + QV° + ). 

Due to the fact that the free vacuum has a vanishing charge (|123|) . tr-po (Q) can be 
interpreted as the charge of our state P = Q + V n _ . We refer to [83] for interesting 
general properties of spaces of the form <3f (H) for any projector P' and infinite- 
dimensional Hilbert space H. 

Thanks to the cut-off in Fourier space, the charge density pq of an operator 

Q £ ©f- is well-defined in L 2 (M 3 ,M), via 

pq(k) = (2^)~ 3 / 2 / +fc/2| < L tr C 4 (Q(p + k/2,p-k/2)) dp. 

\p-k/2\<L 

Finally, the following notation is introduced 

D(f,g)= 4tt J dk, 

for any (f,g) £ i 2 (M 3 ,M) 2 , which coincides with // K 3 xR 3 f(x)g(y)\x - y^dxdy 
when / and g are smooth enough. 

It is possible now to define the Bogoliubov-Dirac-Fock energy as j83l HH [86] 
(127) 

£^ F (Q)--=^(V°Q)-aD(p Q ,v) + ^D(p Q , PQ )-^ ff lQ{x ' y) l 2 dxdy, 

2 2 JJ R 6 \x-y\ 

where 

(128) Q £ Q A := |q £ 6f(%), -V° < Q < . 

It is proved in [85 that any Q £ & 1 (S)a) automatically has its density pq in the 
following so-called Coulomb space: 

(129) C = {f \ D(f, /) < oo} , 

which is the natural space for defining the terms depending on pq in (|127p . Notice 
that the set Qa is one more time the convex hull of our initial states P — where 
P is an orthogonal projector on ^a. The following was proved: 
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Theorem 32 (The BDF energy is bounded-below [33 [83l O [86] ) . Assume that 
< a < 4/tt, A > and that v £ C. 

(i) One has 

VQgQa, E£ df (Q) + ^D(v,u)>0, 
and therefore £ g DF is bounded from below on Qa . 

(ii) If moreover v — 0, then £g DF is non-negative on Qa, being its unique mini- 
mizer. 

The boundedness from below of the BDF energy is an essential feature of the 
theory. It shows the usefulness of the inclusion of the vacuum effects in the model. 
The interpretation of (ii) is the following: by (|125p . it proves that the free vacuum 
V°_ is the unique minimizcr of the (formal) QED energy in the set of all the projec- 
tors P which are such that P — V°_ E Qa- In the previous subsection (Theorem IBT)]) . 
it was also proved that V°_ is the unique minimizer of the energy per unit volume 
T. These are two different ways of giving a mathematical meaning to the fact that 
is the unique minimizer of the QED energy when no external field is present. 

The case v = 0, (ii) in Theorem [321 was proved by Bach, Barbaroux, Helffer 
and Siedentop 6 1. In this paper, the authors also study a relativistic model, but 
with vacuum polarization neglected (see Section 1431 below) . They were inspired of 
a paper by Chaix, Iracane and Lions [37] ■ Then, it has been argued in [83l l84l [86] 
that the proof of the case v ^ 0, (i) in Theorem [32j is a trivial adaptation of [6]. 

Now that £b DF nas been shown to be bounded-below, it is natural to try to 
minimize it. Actually, we shall be interested in two minimization problems. The 
first is the global minimization of £g DF in the whole set Qa- As mentioned above, 
a global minimizer of the mean-field QED energy S q ED and therefore of the BDF 
energy £q D f (they formally differ by an infinite constant !) is interpreted as the 
polarized vacuum in the external electrostatic field —civ * | ■ If one wants to 
describe a system of charge —eN, one has to minimize £g DF in the A^th charge 
sector: 

Q A (N) := {q e Qa | tx v o_ (Q) = AT j . 

These two minimization problems will be tackled in the following two subsections. 

Let us also notice that the boundedness from below of the BDF energy (| 12T[) has 
been used by Hainzl, Lewin and Sparber in |88] to prove the existence of global-in- 
time solutions to the time-dependent nonlinear equation associated with the BDF 
functional: 

(130) ij t P(t) = [P^, , P(t)] 

where [•, •] is the usual commutator and 

V Q :=V + a(p Q -!/)*__ a- 

I- 1 \x-y\ 

(indeed, like in [83] [84], the model for which V°_ and T>° are replaced by P° and D\ 
was considered, but the proof holds similarly in the case of (|130l) ). Global solutions 
are shown to exist for any initial orthogonal projector 

p(o) &v°_ + e 2 (sj A ). 
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For nonlinear Dirac theories, it is usually quite difficult to prove the existence 
of global solutions (see the comments in Section ll.6|l . Here, the proof is highly 
simplified by the important property that the BDF energy £ g DF , which is conserved 
along solutions of (|130[) . is bounded-below and coercive. 

Let us now give the proof of Theorem [32l which is a simple adaptation of argu- 
ments of [6]. 



Proof of Theorem 1321 Let be Q <E Q\, and therefore satisfying the operator 
inequality 

(131) -V°<Q<Vl- 
It is easily proved that (|13 1[) is equivalent to 

(132) Q 2 <Q++ -Q— 

where Q++ := V\QV% and Q — := V°_QV°_. This now implies that 

(133) < tr(|X>°|Q 2 ) < tr P o (V°Q) 

which shows that the kinetic energy is non-negative. We now use Kato's inequality 
(|44| and Equation ([122")) to obtain 

ft \Q^V)? dxdy < JtrdAlQ 2 ) < 1 tr(\V°\Q 2 ). 

JJr^xm 3 \x-y\ 2 2 

Together with (I133|) . this implies 

^df(Q) > (l - cl J) tr(|2?°|Q 2 ) - ^D{y, „), 
which easily ends the proof of Theorem [321 n 



4.3. Global minimization of £b DF : the polarized vacuum. The existence of 
a global minimizcr of £ g DF has been proved by Hainzl, Lewin and Sere, first in [83] 
by a fixed-point argument valid only when a^/\ogK < C\ and aD(v, v) 1 / 2 < C%, 
and then by a global minimization procedure in [84 , valid for any cut-off A and 
< a < 4/ ir. The precise statement of the latter is the following: 

Theorem 33 (Definition of the polarized vacuum [531 [531 [86 ). Assume that < 
a < 4/-7T, A > and that v € C. Then £^ F possesses a minimizer Q on Q\ such 
that V = Q + V_ is an orthogonal projector satisfying the self-consistent equation 

(134) V = x ( -oo,o) (Vq) , 

(135) V Q = V« + a{ PQ - V )*±-a Q ^ y) 



(136) = D 1 +a (p[p__i /2 ] - * p 

Additionally, if a and v satisfy 



\x - y\ 

(V--l/2)(x,y) 



\x - y\ 
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then this global minimizer Q is unique and the associated polarized vacuum is neu- 
tral, i.e. Q € Qa(0): 

(138) tr v o_ (Q) = tr v o_ (V- - V Q _ ) = 0. 

The proof consists in showing that £ g DF is lower semi-continuous for the weak-* 
topology of Qa- For this purpose, one shows that, in the electron-positron held, 
any mass escaping to infinity takes away a positive energy. This is the so-called di- 
chotomy case of the concentration-compactness principle |118j . To prove (|138[) , one 
first shows that tr-po (V- —V^_) is always an integer, then one applies a continuation 
argument. 

Notice that the definition (|119p of T>° has been used to obtain (|136p from (|135p . 
Of course, equations (|134|) and (|136[) are exactly the one we wanted to solve in 
the beginning (| 1 14|) . For not too strong external densities v, a neutral vacuum 
is necessarily obtained, as shown by (|138|) . But in general, a charged polarized 
vacuum could be found. 



-O — n— or n — i 



V 

Figure 3. The polarized vacuum V in the presence of the exter- 
nal density v is the negative spectral projector of the mean-field 
operator Vq. 



In [SB], a thermodynamic limit was considered as for the free case v = 0, to 
justify the formal computation (|126p when v ^ 0. As before, the QED energy is 
well-defined in a box Cl = [— L/2,L/2) 3 with periodic boundary conditions and a 
cut-off in Fourier space by 

(139) £ V L {T) := 8° L (T) - a [[ W L (x - y)p T {x)v L {y) dx dy, 

J Jc L xC L 

where 

(140) Mx) = ^l J2 Hky k -*, 

ke(2irZ 3 )/L 

(for simplicity, it is assumed that v is a smooth function) . Then the following was 
proved in [86] . 

Theorem 34 (Thermodynamic limit with external field [86 ). Assume that < 
a < 4/7T, A > 0, v £ C and that v is continuous on B(0,A). Then for any L, 
possesses a minimizer T L =V L - on Q\ where V L is an orthogonal projector, 

and one has 

(141) lim {£1{T L ) - £° L (r° L )} = min{^ DF (Q), Q € Q A } . 

L — >oo 

Moreover, up to a subsequence, Ql(x, y) ■= (Tl — ^l)( x ' V) = (Pl — T- > l)( x 'V) 
converges uniformly on compact subsets ofM 6 to Q(x,y), a minimizer o/ £g DF on 

Qa- 

Since all the previous results hold for any fixed A, it would be natural to consider 
the limit A — > oo. This was done in [84] where it is argued that this limit is highly 
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unphysical: the vacuum polarization density totally cancels the external density 
v. In QED, this "nullification" of the theory as the cut-off A diverges was first 
suggested by Landau [107j and later studied by Pomeranchuk et al. [133] . 

Notice that in QED, the procedure of renormalization is often used to formally 
remove the cut-off A and the divergence of the theory. It consists in assuming 
that the parameter a is not the physical one but the bare one. The physical 
ttphys — 1/137 is related to a by a formula of the form 



When ctphys is fixed at its physical value, then necessarily A < 10 280 , meaning 
that the large A limit should not be considered in principle. In the case where the 
exchange term is neglected in the energy (the so-called reduced BDF model), this 
renormalization procedure has been studied in detail in [84] . 

4.4. Minimization of £ g DF in charge sectors. The previous subsection was de- 
voted to the global minimization of the BDF energy. We now mention some results 
that have been obtained in [85j for the minimization with a charge constraint. It 
is believed that the charge constrained BDF model can be obtained as the thermo- 
dynamical limit of the full QED model in a fixed charge sector and posed in a box 
with periodic boundary conditions, but this has not been shown yet. 

Due to the charge constraint and like for the Hartree-Fock model for instance, 
minimizcrs will not always exist for the BDF functional: it depends whether the 
external electrostatic potential created by the charge distribution v is strong enough 
to be able to bind the N particles in the presence of the Dirac sea. On the other 
hand, it must not be too strong otherwise electron-positron pairs could appear. 

We start with a general result proved in [85 providing the form of a minimizer, if 
it exists. To this end, we introduce the minimum energy in the TVth charge sector: 



In principle N could be any real number, but here, for simplicity, we shall restrict 
ourselves to integers. 

Theorem 35 (Self-Consistent Equation of a BDF Minimizer [55]). Let be < a < 

4/7T, A > 0, v € C and AfeZ. Then any minimizer Q solution of the minimization 
problem (|142[) . if it exists, takes the form Q = P — V°_ where 



for some fi G [— m(a), m{a)\ . 

Recall that m(a) is the threshold of the free operator T>° defined in (|124|) . We 
remark that (|143|) implicitly means that the last eigenvalue below /j, of the mean-field 
operator Dq is necessarily totally filled. As already mentioned in the Dirac-Fock 
case, this is a general fact for Hartree-Fock type theories [7j. For a minimizer of 



the form (|143|) and when N, /i > 0, it is natural to consider the decomposition 

p = u + XI o^]CDq), 

where n is the polarized Dirac sea: 

n := x(-oo,o)(£>q)- 



a 



C^phys — 



l + 2alogA/(37r)' 



(142) 
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For not too strong external potentials, the vacuum will be neutral, i.e. 

tr 7 ,o(II-7'°) = 1 
and therefore X[o,/j](^q) wm be a projector of rank N: 

N 
71=1 

where £1 < • • • < £at are the first N positive eigenvalues of T>q counted with their 
multiplicity. Notice that 

(144) V Q = D 1 + a(p* - v) * - 

H \x-y\ 

1 (U-I A /2)(x,y) 
+ ap [n -/ A /2] * |7| - a ^ , 

where 

N N 

7* :=X[o,/i](fQ) = X^ n ®^*' :=tr C 4(7*(af,af)) = ^2\f n {x)\ 2 . 

n—l n—1 

In the first line of 1(1440 . the Dirac-Fock operator associated with (<^i, ^jjy) ap- 
pears, see (|95"1) . This shows that the electronic orbitals y>j are solutions of a Dirac- 
Fock type equation in which the mean-field operator T>$> is perturbed by the (self- 
consistent) potentials of the Dirac sea II — /a/2. In practice, these potentials are 
small, and the DF equations are a good approximation of the BDF equations for 
the electronic orbitals. But the energy functionals behave in a completely different 
way: as we have seen, the DF energy is strongly indefinite while the BDF energy 
is bounded below. The Dirac-Fock model is thus interpreted as a non-variational 
approximation of the mean-field model of no-photon QED [36j . 
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Figure 4. Decomposition of the system 'vacuum + TV electrons' 
for the solution P = II + 7$ in the Nth charge sector. 



Concerning the existence of a minimizer, solution of (|143|) . the following result 
was proved in [85] : 

Theorem 36 (Binding Conditions and Existence of a BDF Minimizer [85 ). Let 
be < a < 4/-7T, A > 0, v G C and N € Z. Then the following two assertions are 
equivalent: 

(H-l) E v (N) < min {E V {N - K) + E°(K), K E Z \ {0}} . 

(H2) Each minimizing sequence (Q n )n>i for E V (N) is precompact in Qa and con- 
verges, up to a subsequence, to a minimizer Q of E V (N). 
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Conditions like (Hi) appear classically when analyzing the compactness proper- 
ties of minimizing sequences, for instance by using the concentration-compactness 
principle of P.-L. Lions |118j . They are also very classical for linear models in which 
the bottom of the essential spectrum has the form of the minimum in the right hand 
side of (Hi), as expressed by the HVZ Theorem [95l 11631 [171] . Assume N > for 
simplicity. When < K < N, (Hi) means that it is not favorable to let K electrons 
escape to infinity, while keeping N — K electrons near the nuclei. When K < 0, it 
means that it is not favorable to let \K\ positrons escape to infinity, while keeping 
N + \K\ electrons near the nuclei. When K > N, it means that it is not favorable 
to let K electrons escape to infinity, while keeping K — N positrons near the nuclei. 
When a is small enough and N > 0, it was shown in [85] that the separation of 
electron-positron pairs is not energetically favorable, so that one just needs to check 
(Hi) for K = 1,2,.. .,7V. 

To prove the existence of a minimizer, one can therefore prove that (Hi) holds. 
Two situations in which (Hi) is true have been provided by Hainzl, Lewin and Sere 
in [85] . The first one is the case of weak coupling a <§; 1 and av = v fixed (the 
charge N is also fixed). The following was proved: 

Theorem 37 (Existence of a minimizer in the weak coupling limit 85J). Assume 
that A > 0, that N is a non-negative integer, and that v S C is such that 

(1) the spectrum a(D° — D* | • contains at least N positive eigenvalues below 
1, 

(2) ker(D° -tv*\- \^) = {0} for any t E [0, 1]. 

Then (Hi) holds in Theorem \ 36\ for a small enough and av = D, and therefore 
there exists a minimizer Q a of E v l a (N) . It takes the form 

N 

(145) Q a - X(-oo,o] (DqJ -V°_ +x ( o,^] (P Qa ) := QT + £ 

i=l 

(146) V Qa v?=s?tf 

where s" < ■ ■ ■ < are the N first positive eigenvalues of T>Q a . Finally, for 
any sequence a n — > 0, Vat") converges (up to a subsequence) in 9)a to 

(cpi, ippf) which are N first eigenfunctions of D° — f> =+= | - | 1 and Q™ c converges 
to X(-oo i( ,) (D°-V*\- r 1 ) - P Q _ in 6 2 («a). 

The second situation provided in [85] is the case of the non-relativistic regime 
c ^> 1. To state the result correctly, we reintroduce the speed of light c in the 
model (of course, we shall then take a = 1). The expression of the energy and 
the definition of the free vacuum (which of course then depends on c and the 
ultraviolet cut-off A) are straightforward. We denote by E v a c j i (N) the minimum 
energy of the BDF functional depending on the parameters (a, c, A). The following 
was proved: 

Theorem 38 (Existence of a minimizer in the non-relativistic limit 85 ). Assume 
that a = 1 and that the ultraviolet cut-off is A = Aqc for some fixed Aq. Let be 
v 6 C H L 1 (R 3 ,R + ) with L 3 v — Z, and N a positive integer which is such that 
Z > N—l. Then, fore large enough, (Hi) holds in Theorem\3Bi and therefore there 
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exists a minimizer Q c for E\ c \ oC (N). It takes the following form: 

N 

Qc = x ( -oo,o) (p Q ) -v°_ + xio,„ c ) (Vq) = QT C + £ M) m , 

i=i 

and one has 

km {KcAoc(^) - N 9o(0)} = mm S HF ($). 

c—oo ' " u •S>£H 1 (M 3 _,C 2 ) N 

Gram <&=1 

Moreover, for any sequence c n —* oo, (<fii n , V'jv ) converges in H 1 (M. 3 , <C 4 ) N (up 
to a subsequence) towards ((fx, (fisr) with ipi — (q'), and where $o = (<Pi, <Pn) 
is a global minimizer of the Hartree-Fock energy. 

We notice that this result is very similar to Theorem[2H] providing the convergence 
of the Dirac-Fock 'ground state' in the non-relativistic limit. 

4.5. Neglecting Vacuum Polarization: Mittleman's conjecture. In view of 
the complications introduced by the Dirac sea, some authors [5J O [T3] considered 
an approximate model in which the vacuum polarization is neglected, in the spirit 
of a paper by Mittleman [125] . In this subsection, we shall keep the speed of light 
c and therefore take a = 1. 

In the vacuum case, a possible way to describe Mittleman's approach is first to 
write that the global minimizer V —1/2 constructed in Theorem 1551 is formally a 
solution of the following tautological max-min principle: 

(147) e = sup inf {^ ED (P + 7 ) - «^ ED (P)} . 

p2 P =p -P<J<1-P 

Indeed taking 7 = 0, one finds that e < 0. Saying that V is a global QED minimizer 
exactly means that 

mm {^ed(P + 7) - ^qed(P)} = 0. 

Then, the idea is to approximate (|147[) by neglecting the vacuum polarization terms 
coming from P. One has formally 

(148) r QED (P + 7) - ^ ED (P) = tr(Z? P 7) - // ^ M f } dx dy 

JJr3xr3 \x-y\ 

iff p^)p,(y) dxdy _l [[ \^y)\\ xdy 



,p._ n , ... 1 (P-l/2)(x,y) 



2 J Jr 3 xR 3 \x-y\ 2JJ R 3 XR 3 \x-y\ 

with 

:= D c + P[ P -i/2] * , 

Neglecting the vacuum polarization potentials then simply amounts to replacing 
D p by the free Dirac operator D c . The following max-min principle was studied 
by Bach, Barbaroux, Helffer and Siedentop in [6], inspired by Mittleman |125j : 

(149) e^; tt = sup inf £p c ( 7 ) 

P. i£6i(Sa), 
P 2 =P=P* -P<7<l-P 
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where Sp C is defined as 



(150) ££ c ( 7 ) := tr(£> c7 ) - D{u, Pl ) + \d( Pi , Pi ) - \ [[ 

1 z J JR 




dx dy 



on the set depending on P: 

T{P) := { 7 e 6i(£), |V| 7 6 ©i(J5), -P < 7 < 1 - P}- 



The advantage of this formulation is that, since the vacuum polarization has been 
neglected, no divergence problem is encountered and (|149p can be studied without 
any Fourier cut-off (i.e. Sj — if 1 / 2 (R 3 , C 4 )), and with pointwise external charges 
(i.e. v = ZSq where <5o is the Dirac distribution at the point € ]R 3 ). The following 
was proved in [6]: 

Theorem 39 (Mittleman Principle for the Vacuum [5]). Assume that v = ZSq for 
some Z > and that c > satisfies 4c(l — 2Z/c)/tt > 1. Then 



where S denotes the set of all the orthogonal projectors P which are such that P 
and 1 — P leave the domain of D c — Z/\x\ invariant. Moreover, e^j tt = 0. 

As a consequence, when vacuum polarization is neglected, the Dirac sea is rep- 
resented by the negative spectral projector of the operator D c — Z/\x\. 

The N electron case was studied by Mittleman in |125j . His main idea was to 
justify the validity of the Dirac- Fock approximation by a type of max-min principle 
from QED in which vacuum polarization is neglected. Let us introduce the following 
max-min principle: 



The interpretation is that 7 represents the N electrons, whereas P is the Dirac sea. 
Notice that in this interpretation, the real particles are artificially separated from 
the virtual electrons of the Dirac sea. However, it was believed that (|152[) could be 
a simpler problem with interesting practical implications. Remark that compared 
to (|149p . we have added in (|152[) the condition that 7 commutes with P, i.e. it 
cannot contain off-diagonal terms. Without this requirement, it is known [3] that 
(|152|) cannot give any solution of the Dirac-Fock equations. 

Mittleman's conjecture consists in saying that the max-min principle (|152|) is 
attained by a solution to the Dirac-Fock system. More precisely, we follow [T3] and 
state it as: 

Mittleman's conjecture. A solution of (|152jl is given by a pair 



where $ is a solution of the Dirac-Fock equation D c ^ipi = Ei^pi with Si > 0, and 
where -D Cj $ is the Dirac-Fock mean-field operator defined in (|95[) . 

Mittleman's conjecture for molecules has been investigated in [14j[T3]. In [14] 
Barbaroux, Farkas, Helffer and Siedentop have studied the minimization problem 




(151) e ^; tt = sup inf ££ c ( 7 ), 

pes 7er p 



(152) 




(A 7) = (X(-oo,0)(A:,*),7<f>) 
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on T(P) in (|152|) for a fixed P. Under suitable conditions, they proved that 7 is an 
electronic density matrix 7 = J2i=i \ l Pi)( l Pi\> where ((pi, ipx) is a solution of the 
projected Dirac-Fock equations (1 — P)D C ^(1 — P)tpi = Siipi. Then they further 
proved that if the energy is stationary with respect to variations of the projector 
P, then the unprojected Dirac-Fock are obtained. But they were unable to prove 
the existence of such a state. 

In [13], Barbaroux, Esteban and Sere investigated the validity of Mittleman's 
conjecture by a perturbation argument. Namely they added a parameter r in front 
of the interaction terms as follows: 

(153) £JS^( 7 ) := tr(£> c7 ) - D{v, Pl ) + ^D(p 7 ,p 7 ) - ~ // ^} X ' V ^ dxdy. 

2 2 77 R6 \x-y\ 

Here v is a fixed positive and smooth radial function with compact support and 
J R3 v = 1. They studied Mittleman's conjecture in the regime c> 1 and r -C 1. We 
emphasize that by a scaling argument this physically corresponds to assuming a <C 
1 and Z ^> 1 with aZ <C 1. We denote by £^p' T the Dirac-Fock functional which 
is easily defined with these parameters, and by $ c ' r '° the DF solution obtained by 
the corresponding versions of Theorems [23l and [27l for j — 0. 

When c > 1, it is known that D c — v* | • | _1 is essentially self-adjoint on L 2 (R 3 , C 4 ) 
and that its spectrum is as follows: 

a(Pc -v*\ ■ T 1 ) = (-oo,-c 2 ]U{^i(c) < p 2 {c)---}U [c 2 ,(5o), 

where limj-Kx, fXi(c) — c 2 . We denote by (iVj(c))j>i the multiplicities of the //i(c)'s. 
The following was proved in [13] : 

Theorem 40 (Validity and non validity of Mittleman's conjecture [13 J. Assume 
that v is a fixed positive and smooth radial function with compact support and such 
that L 3 v = 1 . 

If N = Si=i-^i( c ) f or some I > 1 and some /i:red c > 1, £/ien Mittleman's 
conjecture is true for r small enough: one has 

where $ C > T '° is any of the solutions obtained in Theorem \23\ in the case j = 0. 
Moreover, the optimal projector for the sup part of (|152[) is P = X(-oo.o)(^c,* c t )- 
If N = 53i=i N.i(c) + 1 /or some I > 1 and c Zarge enough, then Mittleman's con- 
jecture is wrong when r > is small enough: there is no solution $ 0/ t/ie Dirac- 
Fock equations with positive multipliers such that the pair (x(-oo,o)(-^c,#)>7*) re_ 
alizes Mittleman's max-min principle (|152p . 

One can consider a weaker version of Mittleman's conjecture which consists in 
only comparing energy levels and not the solutions themselves. 

Weaker Mittleman's conjecture. One has 

&N)=£^' T (^°). 

When N = 1 and for c 3> 1, t <C 1, it is known by Theorem |4TH that Mittleman's 
conjecture is wrong. But Barbaroux, Helffer and Siedentop proved in [15] that the 
weaker conjecture is indeed true. 
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Theorem 41 (Validity of the weaker Mittleman conjecture for N = 1 [15]). Assume 
that v = 5q. Then for c large enough and r small enough, the weaker Mittleman 
conjecture is true: 

The sup-inf (|152p is realized by the pair 

(P C)7 ) with P c = X{-oofi){Do-l/\x\), 7 = r¥>WI 

where (p c is any eigenvector of 

(154) D c -l/\x\ 

with eigenvalue ej(jj^ (TV). 

In [15] . an explicit condition on r and c is provided. Notice the equality 

D c ,^ c = D c , Q( p c = (D c - l/\x\W = e^ T t (N)v c 

which is very specific to the one-electron case, and means that the electron does 
not "see" its own Coulomb field. In particular P c ip c = 0. 
For N > 2, the question remains completely open. 
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